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(57) Abstract 



The present invention relates to 1 Icby, in particular 1 Icby polypeptides and 1 Icby polynucleotides, recombinant materials and methods 
for their production. In another aspect, the invention relates to methods for using such polypeptides and polynucleotides, including the 
treatment of bacterial, fungal, protozoan and viral infections, particularly infection caused by HTV-l or HIV-2; pain; cancers; diabetes; 
obesity; feeding and drinking abnormalities, such as anorexia and bulimia; asthma; Paridnson's disease; both acute and congestive heart 
failure; hypotension; hypertension; urinary retention; osteoporosis; angina pectoris; myocardial infarction; ulcers; allergies; benign prostatic 
hypertrophy and psychotic and neuroio^cal disorders, including anxiety, schizophrenia, manic depression, delkium, dementia or severe 
mental letanlation, and dyskinesias, such as Huntmgton*s disease or Gilles de la Tourett's syndrome, hereinafter referred to as "the Diseases", 
amongst others. In a still further aspect, the mvention relates to diagnostic and prognostic assays for detecting diseases associated with 
inappropriate 1 Icby activity or levels. A method of performing genetic association studies for searching a disease susceptibility and/or drug 
response genes comprising using pol3nno]phic markers in 1 Icby polynucleotides. Yet in further aspect, the invention relates to transgenic 
animals, and use of transgenic animals for disease models to screening for therapeutic. 
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11 cby Gaiomic Sequence 

Fidd of the Invention 

This invention overall relates to the field of human genetics. More particularly, this 
invention relates to newly identified polypeptides and polynucleotides encoding such 
5 polypeptides, to the use of polynucleotides in gene therapy, and to production of such 

polypq)tides and polynucleotides. In another aspect, this mvention relates to prognostic and 
diagnostic methods for human diseases. This invention also relates to a method of performmg 
genetic association studies for searching a disease susceptibility and/or drug response genes 
comprising using polymorphic markers. Yet in further aspect, the invention relates to transgenic 
10 animals, and use of transgenic animals for disease models to screening for therapeutic 
compounds. 

Background of tiie Invention 

The drug discoveiy process is currently undergoing a fundamental revolution as it embraces 
'functional g^mics', that is, hi^ throughput g^ome- or g^e-based biology. This approach is 
1 5 r^idly superceding earlier ^proaches based on 'positional cloning' . A phmotype, that is a bblogical 
function or genetic disease, would be identified and this would then be tracked back to the responsible 
gene, based on its genetic map position. 

Functional genomics relies heavily on the various tools of biomformatics to identify gent 
sequences of potential interest from the many molecular biology databases now available. There is a 
20 continuing need to id^itify and characterise further genes and their related polypq}tides/proteins, as 
targets for drug discoveiy. 

It is well established that many medically significant biological processes are mediated by 
proteins participating in signal transduction pa&ways that involve G-proteins and/or second 
messengers, e.g., cAMP (Lefkowitz, Nature, 1991, 351:353-354). Herein these proteins ate referred 
25 to as protems participating in pathways with G-proteins or PPG proteins. Some examples of these 
. proteins include the GPC recqptors, such as those for adrenergic agents and dopamine (Kobilka, B.K., 
et al., Proc. Natl Acad. Sci., USA, 1987, 84:46-50; KobUka, B.K., et al.. Science, 1987, 238:650- 
656; Bunzow, J.R., et al.. Nature, 1988, 336:783-787), G-proteins themselves, effector proteins, e.g., 
phospholipase C, adenyl cyclase, and phosphodiesterase, and actuator proteins, e.g., protein kinase A 
30 and protein kinase C (Simon, M.I., et al., Science, 1991, 252:802-8). 

For example, in one form of signal transduction, the effect of hormone binding is activation of 
the enzyme, adenylate cyclase, inside the cell. Enzyme activation by hormones is dependent on the 
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presence of the nudeotideGTP. GTP also influeiices honnone binding. A G-protein connects the 
hormone recq)tor to adenylate cyclase. G-protein was shown to ^cdiange GTP for bound GDP when 
activated by a hormone receptor. The GTP-canying form then binds to activated admylate cydase. 
Hydrolysis of GTP to GDP, catalyzed by the G-protein itsdf, returns the G-protein to its basal, 
5 inactive form. Thus, the G-protem serves a dual role, as an intermediate that rdays the signal from 
recq)tor to effector, and as a clock tiiat controls the duration of the signal. 

The membrane protein gene supalamily of G-protein coupled recq)tors has been characterized 
as having seven putative transmembrane domains. The domains are bdieved to represent 
transmembrane a4ielices connected by extracellular or cytoplasmic loops. G-protein coupled recq}tors 
1 0 indude a wide range of biologically active receptors, such as hormone, viral, growth factor and 
n^iroreceptors. 

G-protein coupled recq>tors (otherwise known as 7TM recq>tors) have been diaractaized as 
indudmg ftese seven conserved hydrophobic str^dies of about 20 to 30 amino acids, connecting at 
least eight divergent hydrophilic loops. The G-protem fiamily of coupled receptors indudes dopamine 

1 5 recqytors whidi bind to neurolq)tic drugs used for treating psydiotic and neurological disorders. 
Other examples of members of this family include, but are not lunited to, caldtonin, adren^gic, 
endothdin, cAMP, adenosine, muscarinic, acetyldioline, smtonin, histamine, thrombin, kinin, follide 
stimulating hormone, opsins, endothdial difier^tiadon gene-1, rhodopsins, odorant, and 
cytomegalovirus receptors. 

20 Most G-protein coupled recqytors have single conserved cysteine residues in eadi of the first two 

extracellular loops which form disulfide bonds that are bdieved to stabilize functional protein structure. 
The 7 transmembrane r^ons are designated as TMl, TM2, TM3, TM4, TM5, TM6, and TM7. 
TM3 has been implicated in signal transduction. 

Phosphorylation and lipidation ^almitylation or famesylation) of c^teine residues can influence signal 
25 transduction of some G-protein coupled receptors. Most G-protein coupled receptors contain pot^al 
phosphorylation sites within the third cytoplasmic loop and/or the carboxy termmus. For several G^ 
protein coupled receptors, sudi as the b-adrraorecq)tor, phosphorylation by protein kmase A and/or 
spedfic recq)tor kinases mediates receptor desensitization. 

For some receptors, the ligand binding sites of G-protein coupled receptors are believed to 
3 0 comprise hydrophilic sockets formed by several G-protein coupled recq)tor transmembrane domains, 
^said socket being surrounded by hydrophobic residues of the G-protein coupled receptors. The 
hydrophilic side of each G-protein coupled receptor transm^brane hdix is postulated to face inward 
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and fonn polar ligand binding site. TM3 has been implicated in several G-protein coupled recqptors as 
having a ligand binding site, sudi as the TM3 aspartate residue. TM5 serines, a TM6 asparagine and 
TM6 or TM7 phenylalanines or tyrosines are also implicated in ligand binding. 

G-protein coupled rec^tors can be intracellularly coupled by h^erotrimeric G-proteins to 
5 various intracellular enzymes, ion channels and transporters (see, Johnson et al., Endoc. Rev., 1989, 
10:317-331) Diffident G-protein a-subunits prefi^entially stimulate particular eEfectors to modulate 
various biological functions in a cell. Phosphorylation of cytoplasmic residues of G-prot^ coupled 
receptors have been id^fied as an important mechanism for the regulation of G-protein couplmg of 
some G-protem coupled recqptois. G-protein coupled recq)tors are found in numerous sites within a 
10 mammalian host. 

Over the past IS years, nearly 350 therq)eutic agents targetmg 7 transmembrane (7 TM) 
receptors have been successfully mtroduced onto the market. 

Summaiy of the Invention 

15 The present mv^tion relates to 1 Icby, in particular llcby polypeptides and llcby 

polynucleotides, recombmant matmals and methods for thdr production. In anoth^ aspect, the 
invendon rdates to methods for using such polypq>tides and polynucleotides, including the treatm^ 
of bacterial, fungal, protozoan and viral infections, particularly infiecdon caused by HIV-1 or HIV-2; 
pain; cancas; diabetes; obesity; feeding and drinking abnormalities, such as anorescia and bulunis^ 

20 asduna; Parkinson's disease; both acute and congestive heart failure; hypotension; hypertension; 
urinary retention; osteoporosis; angina pectoris; myocardial m&rcdon; ulcers; allergies; benign 
prostatic hypotrophy and psychotic and neurological disorders, mduding anxiety, schizophrenia, 
manic d^ression, delirium, dem^a or severe mental r^ardation, and dyskinesias, such as 
Huntington's disease or Gilles dela Tourett's syndrome , herdnafi^ rdierred to as "the Diseases", 

25 amongst odiers. In a still further aspect, the invention relates to diagnostic and prognostic assays for 
detecting diseases associated with mappropriate 1 Icby activity or levels. A method of performing 
genetic association studies for searching a disease susceptibility and/or drug response genes 
comprising using polymorphic markers in llcby polynucleotides is also subject of present invention. 
Further, the present invention relates to a method of providing an individual in need of llcby 

30 polypeptide comprising providing a 1 Icby polynucleotide through a gene therapy. Yet in further 
.^pect, the invention relates to transgenic animals, and use of transgenic animals for disease 
models to screening for therapeutic compounds. 
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Brief Description of the Drawings 

Fig. 1 is genomic sequence llcby. 
Description of the Invention 

In a first aspect^ the present invention relates to llcby polynucleotides. Such 
5 polynucleotides include a polynucleotide comprising the polynucleotide of SEQ ID NO: 1 . The 
nucleotide sequence of SEQ ID NO: 1 is a gDNA (genomic DNA) sequ^ce containing a sequence 
whidi encode the polypqjtide of the SEQ ID N0:2. Further 1 Icby polynucleotides of flie preset 
inv^tion include isolated polynucleotides comprising a nucleotide sequence which has at least 70% 
identity, preferably at least 80% identity, more preferably at least 90% identity, yet more 
1 0 preferably at least 95 % identity, to SEQ ID NO: 1 over flie entire length of SEQ ID NO: 1 , In this 
regard, polynucleotides which have at least 97% identity are highly prefiarred, whilst fliose wifli at least 
98-99% identiy are more highly prefared, and tiiose witii at least 99% identity are most highly 
preferred. Such polynucleotides include a polynucleotide of SEQ ID NO: 1. 

The llcby polynucleotides include a polynucleotide which hybridizes under stringent 
15 conditions to a polynucleotide of SEQ ID NO: 1 or to a fragment thereof, or to a complement of 
polynucleotide of SEQ ID N0:1 or to a fragment tiiereof. 

The llcby polynucleotides also include an isolated polynucleotide which comprises a 
polynucleotide which has at least 
(a) 70% identity; 
20 (b) 80% identity; 

(c) 90% identity; 

(d) 95% identity; 

(e) 97% identity; or 

(0 98% identity to a polynucleotide selected from the group consisting of 
25 a. SEQ ID N0:3; 

b. SEQ ID N0:5; and 

c. SEQ ID NO:7; 

over the entire lengfli of each SEQ ID N0S:3, 5 and 7, respectively. 

The 1 Icby polynucleotides also include a polynucleotide which hybridizes under stringent 
30 conditions to any of the polynucleotide of SEQ ID NO: 3, 5 or 7, or to a fragment thereof, or to 
-a.complement to any of the polynucleotide of SEQ ID NO:3, 5 or 7, or to a fragment thereof. 
The llcby polynucleotides also include an isolated polynucleotide which has at least 
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(a) 70% identity; 

(b) 80% identity; 

(c) 90% id^tity; 

(d) 95% idoitity; 

5 (e) 97% identity; or 

(f) 98% identityto a polynucleotide selected from the group consisting of 

a. SEQIDN0:4;and 

b. SEQIDNO:6: 

over the entire length of each SEQ ID NOS: 4 and 6, respectively. 

1 0 The 1 Icby polynucleotides include a polynucleotide which hybridizes under stringent 

conditions to any of the polynucleotide of SEQ ID N0:4 or 6, or to a fragment thereof, or to a 
complement to any of the polynucleotide of SEQ ID N0:4 or 6, or to a fragment thereof. 

A dq)osit containing a 1 Icby gDNA has be^ deposited with the Ammcan Culture 
Collection (ATCQ, University Boulevard, Manassas, VA 20110-2209, USA, on October 29, 1998, 

15 and assigned ATCC Deposit Number 98964. The dq)osited material ATG-1222, herdnafier ref^red 
to as the "deposited done" is DHIOB contaming pCYPACl that further contains tbie gDNA (genomic 
DNA). The nucleotide sequ^ce of &e polynucleotides contained in the deposited material, as weQ as 
the ammo acid sequence of the polypeptide encoded thereby, are controlling in the ev^t of any conflict 
witih any description of sequences herdn. The deposit has been made under the terms of the Budapest 

20 Treaty on the international recognition of the dq>osit of micro-organisms for purposes of patent 
procedure. The strain will be krevocably and without restriction or condition released to the public 
upon the issuance of a patent. The deposit is provided merely as convenience to those of skill in die art 
and is not an admission that a deposit is required for enablement, such as that required under 35 
U.S.C. §112. 

25 In a further aspea, Ilcby polynucleotides also include a polynculeotide which conprises a 

polynucleotide which has at least 70% identity, preferably, at least 80% identity, more preferably at 
least 90% id^tity, particularly at least 95% identity, to gDNA in the dq)osite!d done over the entire 
length of the gDNA, as well as a nucleotide sequence comprising a fragment of such gDNA insert. 
Furfliermore, those with at least 97% identity are highly preferred, those wifli at least 98-99% identity 

30 are most highly preferred, and those with at least 99% identity are most preferred. 

^ • The mvention also provides polynucleotides which are compl^nentaiy to all the 
aforemezitioned polynucleotides. 
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It will be readily appreciated that the deposited clone, or a firagmMit thereof, may also be used 
as a hybridization probe, for cDNA and genomic DNA, to isolate M-laigfli cDNAs and g^mic 
clones, as described dsewh^e. 

The llcby polynucleotides also include polynucleotides which are complementary to all 
5 the above described polynucleotides. 

In another aspect tiie present invention relates to llcby polypeptides, llcby polypeptides 
include polypeptides which are encoded by llcby polynucleotides. Particular preferred member 
of the llcby polypeptides is polypeptide of SEQ ID NO: 2. The polypeptide of SEQ ID NO: 2 is 
believed to be a member of the 7-transmembrane teceptoi family of polypeptides. It is therefore of 
10 mterest because is has now been discovered to bmd to 19 amino acid peptide melanin- 

concentratmg hormone (MCH) with amino acid sequence of H-Asp-Phe-Asp-Met-Leu-Arg-Cys- 
Met-Leu-Gly-Arg-Val-Tyr-Arg-Pro-Cys-Trp-Gln-Val-OH. SEQ ID NO: 8. Its properties are 
hereinafter referred to as "llcby activity" or "llcby polypeptide activity" or "biological activity 
of 1 Icby*" . 1 Icby polypeptide and its encoding cDNA are descrbed in EP 0848060 published June 
15 17, 1998. 

The polypeptides of the present invention may be in the form of the "mature" protein or 
may be a part of a larger protein such as a fusion protein. It is often advantageous to include an 
additional amino acid sequence which contains secretory or leader sequences, pro-sequences, 
sequences which aid in purification such as multiple histidme residues, or an additional sequence 

20 for stability during recombinant production. 

The present invention also includes include variants of the aforementioned polypetides, fliat is 
polypeptides that vary from the referents by conservative amino acid substitutions, whereby a residue is 
substituted by another wifli like characteristics. Typical such substitutions are among Ala, Val, Leu 
and Be; among Ser and Thr; among the acidic residues Asp and Glu; among Asn and Ghi; and among 

25 the basic residues Lys and Arg; or aromatic residues Phe and Tyr, Particularly prrferred are variants 
in which sev«^, e.g. 1-5, 5-10, 10-20, 20-50, etc. amino acids are substituted, deleted, or added in 
any combination. 

Polypeptides of the present invention can be prepared in any suitable manner. Such 
polypq)tides include isolated naturally occurring polypq)tides, recombinantiy produced polypq)tides, 
3 0 synthetically produced polypeptides, or polypeptides produced by a combination of these m^ods. 
—Means for prq>aring such polypeptides are wdl understood in the art. 
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Prefixed polypeptides and polynucleotides of the present inv^tion are expected to have, inter 
aUa, similar biological functions/properties to tiieir homologous polypeptides and polynucleotides. 
Furthermore, preferred polypeptides and polynucleotides of tiie present invaition have at least one 
llcby activity. 

S Polynucleotides of the present invention may be obtained, using standard doning and 

screening techniques, from a gDNA library derived from cells of human whole brain library. 
Polynucleotides of the invention can also be synthesized using well known and commercially 
available techniques. 

When polynucleotides of the present invention are used for the recombinant production of 
1 0 polypeptides of the present invention, the polynucleotide may be the coding sequence for the mature 
polypq>tide, by itself; or the coding sequ^ce for the mature polypeptide in reading fi:ame with other 
coding sequraces, such as those encoding a leader or secretory sequence, a pre-, or pro- or prq>ro- 
protein sequence, or otiier fusion peptide portions. For exanq)le, a marker sequence whidi facilitates 
purification of the fiised polypq>tide can be ^coded. In certain preferred embodiments of this aspect 
IS of the invention, Ae marker sequence is a hexa-histidine peptide, as provided in the pQE vector 

(Qiagen, Inc.) and described in Gentz et oL , Proc Natl Acad Sd USA (1989) 86:821-824, or is an HA 
t^g. The polynucleotide may also retain non-coding 5* and 3* sequences, such as transcribed, non- 
translated sequ^ces, splicing and polyadenylation signals, ribosome binding sites and sequences tiiat 
stabilize mlRNA. 

20 Further embodiments of the present invention include polynucleotides variants which con^rise 

the polynucleotide of SEQ ID NO: 1 in which 200 to 100, 100 to 50, 50 to 10,5to 10. 1 to 5, lto3, 
1 to 2 or 1, nucleotide residues are substituted, ddeted or added, in any combination. 

Polynucleotides whidi are identical or sufBciently identical to a nudeotide sequence contained 
in SEQ ID NO: 1, may be used as hybridization probes for cDNA and genomic DNA, or as primers 

25 for a nuddc acid amplification (PGR) reaction, to isolate full-length cDNAs and g^mic clones 

encodmg polypq)tides of flie present invention, and to isolate cDNA and genomic dones of other g^ies 
Cuodudmg genes encoding homologs, paralogs, and orthologs from spedes otiier tiian human) that have 
a hig}i sequence simflarity to SEQ ID NO: 1 . l^ically tiiese nudeotide sequences are 70% identical, 
preferably 80% identical, more preferably 90% identical, most preferably 95% identical to that of tiie 

3 0 referent. Even more prefiarably , they are 97-99 % identical to the referent. The probes or primers will 

g^i^ly coiqprise at least 15 nudeotides, pref^ably, at least 30 nudeotides and rosy have at least 50 

nudeotides. Particularly preferred probes will have between 30 and 50 nucleotides. 
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A polynucleotide encoding a polypeptide of the present inv^tion, including homologs, 
paralogs, and orthologs from species other than human, may be obtained by a process which coiiq)rises 
the steps of screening an appropriate library under appropriate hybridization conditions altering for 
^cample, salt concentration, ten^^ature, washing times and buffers with a labeled probe having die 
5 sequttice of SEQ ID NO: 1 or a fragment thereof; and isolating full-length cDNA and genomic dones 
containing said polynucleotide sequence. Sudi hybridization tedmiques are well known to the skilled 
artisan. Pr^srred appropriate hybridization conditions are ^ven for example m Chruch, G.M. and 
GilbMt, W (1984) Genomic sequencing. Proc. Nafl. Acad. Sci. USA 81:1991-1995. Qneprefiaied 
condition indudes overnight incubation at 42^0 in a solution comprismg: 50% formamide, 5xSSC 

1 0 (15QmM NaQ, 15mM trisodium dtrate), 50 mM sodium phosphate ^H7.6), 5x Derihardt's solution, 
10 % d^an sulfate, and 20 microgram/ml draiatured, sheared sahnon sperm DNA; followed by 
washing fiie filters in 0. Ix SSC at about 65^0. Thus the present invention also includes 
polynucleotides obtainable by scre^g an ^propriate library under stilngent hybridization conditions 
with a labded probe having the sequence ofSEQ ID NO: 1 or a fragment theceof, or with a labded 

15 probe having the con^ilementaxy sequence to SEQID N0:1 or a fragment thereof. 

Recombinant polypqrtides of the present invention may be prepared by processes wdl known 
in the art from genetically engme^:ed host cells comprising expression systems. Accordingly, in a 
further aspect, tiie present invration rdates to e?q)ression syst^ns whidi comprise a polynucleotide or 
polynudeotides of the present inv^on, to host cdls whidi are genetically engme^ed with sudi 

20 expression sytems and to &e production of polypeptides of the invention by recombmant tedmiques. 
Cell-free translation syst^ns can also be employed to produce sudi protems usmg RNAs derived from 
the DNA constructs of the pres^ invention. 

For recombinant production, host cdls can be genetically engineered to incorporate e3q)ression 
systems con^rising polynudeotides of the present inv^on or a portion fiiereof . Introduction of 

25 polynudeotides into host cdls can be effected by methods described in many standard laboratoiy 
. manuals, such as Davis et al. , Basic M^ods in Molecular Biology (1986) and Sambrook et al.. 
Molecular aoning: A Laboratory Manual, 2nd Ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, N.Y. (1989). Prefisired sudi mediodsinchide, for instance, cddum phosphate 
transfection, DEAE-dextran mediated transfection, transvection, microinjection, cationic lipid-mediated 

3 0 transfection, elec&oporation, transduction, scrspe loading, ballistic introduction or infection. 
— Representative examples of s^propriate hosts indude bacterial cdls, sudi as strqnooKd^ 
stcpf^Iococd, K coU, Strq>tomyces and Badllus subtiUs cdls; fungal cdls, such as yeast cdls and 
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Aspergillus cells; insect cells such as Drosophila S2 and Spodoptera Sf9 cells; animal cdls sudi as 
CHO, CX)S, HeLa, C127, 3T3, BHK, HEK 293 and Bowes melanoma cells; and plant cells. 

A great variety of expression systems can be used, for instance, chromosomal, episomal and 
virus-derived systems, e.g., vectors d^ved from bacterial plasmids, from bacteriophage, from 
5 transposons, from yeast q)isomes, from insertion elements, from yeast dtromosomal dements, fix)m 
viruses such as baculoviruses, pg^va viruses, sudi as SV40, vacdnia viruses, adenoviruses, fowl pox 
viruses, pseudorabies vkuses and retroviruses, and vectors derived from combinations thereof, such as 
those derived from plasmid and bacteriophage genetic elements, such as cosmids and phagemids. The 
es^ression systems may contain control regions diat regulate as well as engender expression. 

1 0 GCTerally , any system or vector whidi is able to maintain, propagate or express a polynucleotide to 
produce a polyp^tide m a host may be used. The appropriate nucleotide segurace m^ be inserted 
into an e3q)ression system by any of a variety of well-known and routine techniques, sudi as, for 
example, those set forth in Sambrook et al,, MOLECULAR CLONING, A LABORATORY MANUAL 
{stq^rd). Appropriate seoredon signals may be mcoiporated into die desu^ polyp^tide to allow 

1 5 secredon of die translated protem mto die lumen of the sadoplasnuc reticulum, die p^lasmic space or 
the extracellular environment. Hiese signals may be endogenous to the polypq)tide or they may be 
heterologous signals. 

If a polypq)dde of the present invention is to be expressed for use in agonist/antagonist 
screaiing assays, it is generally prefored that the polypeptide be produced at the surfece of the cell. 
20 In this event, the cells may be harvested prior to use in the screening assay. If the polypeptide is 
secreted into the medium, the medium can be recovered in order to recover and purify the 
polypeptide. If produced intracellularly, the cells must first be lysed before the polypeptide is 
recovered. 

Polypeptides of the present invention can be recovered and purified from recombinant cdl 
25 cultures by wdl-known mefliods iucluding ammonium sulfate or ethanol precipitation, acid extraction, 
anion or cation exchange chromatography, phosphocellulose diromatogr^hy, hydrophobic interaction 
diromatogr^hy, afifinity chromatography, hydro^lapatite chromatography and lectin diromatography. 
Most preferably, high performance liquid chromatography is employed for purification. Well known 
techniques for refolding proteins may be employed to regenerate active conformation when the 
30 polyp^tide is denatured during isolation and or purification. 

The modern devdopment of molecdar biologicd techniques has pOTnitted investigation mto 
the genetic abnormalities that cause, or corrdate with, specific human disease states and conditions. 
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For exsanple, restriction fragment lengfli polymorphism (RFLP) analysis facilitates the identification of 
genetic defects which cause or are correlated with disease states, and facilitates the id^tification of 
individuals possessing the genetic defects. RFLP procedures involve digesting DNA wifli one or more 
restriction enzymes and analyzing the restriction fragments using, e.g., Soufliem blot hybridizations 
5 employing selected gene probes. See Alberts et al.. Molecular Biology of the Cell, Second Edition, 
New York:Garland Publishing (1989), pp. 270-71. 

The polymerase diain reaction (PGR) and its many variations are particularly useful tools for 
investigation into g^etic abnormalities that underlie disease states and conditions. (See, e.g., Erlich et 
al., Current Communications in Molecular Biology: Polymerase Chain Reaction, Cold Spring 

10 Harbor:Cold Spring Harbor Press (1989); Innis et al., PCT Protocols: A Guide to Mefliods and 
Applications. San Diego:Academic Press (1990).) PGR is used to an5)lify a DNA or one or more 
portions thereof that are of particular interest, to facilitate fimfaer characterization of the amplified 
portion. Such farOier characterization includes gel dectrophoresis to determine size, nudeotide 
sequencmg, hybridization studies using particular probes, and the like. See generally, Sambrook et al., 

1 5 Molecular Qoning-A Laboratory Manual, Second Edition, Cold Spring Harbor:Cold Spring Harbor 
Press (1989). 

This invention also relates to the use of polynucleotides of the present invention as diagnostic 
reagents. Individuals with the wild-type llcby gene do not have Diseases which result from the llcby 
allele. Detection of a mutated form of the gene will provide a diagnostic tool for a Disease, to 
20 prognose of a Disease, to det^mine the susceptibility to a Disease, to monitor a Disease progression, 
or to profile a Disease, As used herein a Disease refers to any of the Diseases defined earlier. Thus 
the present invention relates to a method of detecting variations in llcby alldes among the 
population comprising comparing the 1 Icby gene sequences of the population. In further aspect, this 
invention also relates to a method of detecting a presence of or an absense of variation in a llcby 
25 allele in an individual from that of SEQ ID NO: 1, comprising comparing a llcby allele sequence 
of tiie individual with tiiat of SEQ ID NO: 1. 

In anolh^ aspect, the invention provides a mediod of screening an individual for an increased 
risk of developing a Disease comprising the steps of: 

(a) assaying for a presence or abs^ce of a 1 Icby polynucleotide mutation relative to 1 Icby 
30 polynucleotide in a normal individual; and 

— (b) correlating ihe presence of a mutation to an increased risk of devdopmg a Disease, and 
absence of mutation to no increased risk of developing a Disease. 
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Yet in another aspect, the invention provides a method for diagnosing a Disease in an 
individual comprising: 

(a) assaymg for a presence or absence of a 1 Icby polynucleotide mutation relative to 1 Icby 
polynucleotide in a normal individual; and 
5 (b) corrdating the presence of a mutation to a presence of a Disease, and absence of mutation 

to an absence of a Disease. 

Individuals carrying mutations in the graie may be detected at the DNA levd by a variety of 
tedmiques. Nudeic acids for diagnosis may be obtained from, for example, a subject's cdls, blood, 
urine, stool, sputum, hair, vaginal swabs, semen, saliva, tissue biopsy or autopsy material. Useful 

1 0 diagnostic tedmiques indude, but are not limited to; fluorescent in situ hybridization (FISH), direct 
DNA sequencing, PFGE analysis (SSCA), RNase protection assay, allde-specific oligonudeotide 
(ASO), dot blot analysis and PCR-SSCP, denaturing HPLC as discussed in d^ fiirflier below. 

Direct DNA sequencing, eithCT manual sequencmg or automated fluorescent sequencing can 
detect sequence variation. Another approadi is the single-stranded conformation polymorphism assay 

15 (SSCA or SSCP) (Orita et al., Proc. Natl. Acad. Sd. USA 86:2776-2770, 1989). This method does 
not detect all sequence dianges, especially if the DNA fragment size is greater than 200 bp, but can be 
q)itomized to detect most DNA sequence variation. The reduced detection sensitivity is a 
disadvantage, but the increased throughput possible with SSCA makes it an attractive, viable alternative 
to direct sequencing for mutation detection. The fragments which have shifted mobility on SSCA gels 

20 are th^ sequenced to determine the exact nature of the DNA sequence variation. Other approaches 
based on the detection of mismatches between the two complementary DNA strands indude clamped 
denaturing gd dectrophoresis (CDGE) (Sheffield, et al., Am. J. Hum. Genet. 49:699-706,1991), 
hetax)duplex analysis OEIA) (White et al.. Genomics 12:301-306,1992) and chemical mismatdi 
deavage (CMQ (Grompe et al., Proc. Natl. Acad. Sd. USA 86:5855-5892,1989) Denaturing HPLC, 

25 similar to SSCP, can be used which uses a PCR amplified product, run down a heated HPLC column; 
the heating disassociates the DNA strands and one usually sees two peaks if th^e is a variation in the 
alldic forms. It works on the same principles as SSCP. Blind analysis of denaturing high-performance 
liquid duomatography as a tool for mutation detection. O'Donovan MC, Oefiier PJ, Roberts SC, 
Austin J, Hoogendoom B, Guy C, Speight G, Upadhyaya M, Sommer SS, McGuffin P., Genomics 

30 1998 Aug 15:52(1):44^. 

A rapid prdiminary analysis to detect polymorphisms in DNA sequaices can also be 
performed looking ^ a series of Southmi blots of DNA cut with one or more restriction em^es. 
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preS^ably with a large number of restriction enzymes. Each blot contains a series of normal 
individuals and a series of diseased cases, or both. Southern blots displaying hybridizing fragments 
differing in length from control DNA when probed widi sequences near or including the llcby locus 
uidicate a possible mutation. If restriction enzymes which produce very large restriction fragmeorts are 
5 used, then pulsed fidd gd dectrophoresis (PFCT) is en?)loyed. 

A review of currently available methods of d^ecting DNA sequence variation can be found in 
a recent review by Grompe (Gtompe, Nature Generics 5: 1 1 1-1 17, 1993). Once a mutation is known, 
an allde specific detection q)proach sudi as allde spedfic oligonudeotide (ASO) hybridization can be 
utilized to rapidly screen large number of other samples for the same mutation. Among the most wdl 

1 0 known mediods for confirming die pres^ce of a suscq}tibility allde, are sudi as: 1) single stranded 
confonnation analysis (SSCA) (Qrita et al., Proc. Natl. Acad. Sd. USA 86:2776-2770, 1989); 2) 
denaturing gradirat gd dectrophoresis (DGGE) (Wartdl et al., Nucl. Adds. Res. 18:2699-2705, 
1990; Shieffield et d., Proc. Natl. Acad. Sd. USA 86:232-236, 1989); 3) RNase protection zssays 
O^hikdstdn et d.. Genomics 7:167-172, 1990; Kinszler et d., Sdence 251: 1366-1370,1991); 4) 

15 dlde^pecific oligonudeotides (ASOs) (Conner et d, Proc. Nafl. Acad. ScL USA 80:278-282, 1983); 
5) the use of proteins whidi recognize nucleotide mismatches, sudi as the E. coli routs protdn 
(Modridi, Ann. Rev. Gen^ 25:229-253,1991); and 6) allde^specific PGR (Rano &iadd, Nud. 
Adds. Res. 17:8392, 1989). The present invention is not limited to a particular method but aiQr and 
dl mediods whidi can be used to identity mutations. 

20 in the first three m^ods (SSCA, DGGE and RNase protection ass^), a new dectrpphoretic 

band qipears. SSCA detects a band whidi migrates differ^itidly because the sequ^ce diange causes a 
difference in sii^enStrand, intennolecdar base pairing. RNase protection involves deavage of die 
mutant polynudeotide into two or more smaller firagments. DGGE d^ects differences in migration 
rates of mutant sequences compzted to wild-type sequences, using a denaturing gradient gd. In an 

25 dlde-spedfic oligonudeotide ass^, an oligonudeotide is designed whidi detects a specific sequence, 
and the assff^ is perfioimed by detecting a specific sequence, and the assay is performed by d^ecting 
the presence or absoice of a hybridization signd. In die mutS assay, the protein binds only to 
sequences that contam a nudeotide mismatch in a heteroduplex betweoi mutant and wild-type 
sequences. 

30 Mismatches, according to the present invention, are hybridized nudeic acid duplexes in which 

_die two strands are not 100% complementary. Lack of totd homology may be due to ddetion, 
insertions, inversions or substitutions. Mismatdi detection can be used to detect point mutations in the 
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gene. While these techniques are less sensitive than sequencing, they are sinipler to perform on a 
large number of samples. An exanq)le of a mismatch cleavage tedmique is the RNase protection 
method. In the practice of the present invention, the method involves the use of a labeled riboprobe 
which is complem^itary to the human wild-type 1 Icby g^e coding sequence. The riboprobe and 
S eitfa^ roRNA or DNA isolated from the sanq)le are annealed (hybridized) togeth^ and subsequendy 
digested with the enzyme RNase A whidi is able to detect some mismatches in a duplex RNA 
structure. Amisnrntdiisd^ectedlryRNaseA, itcleavesatthesiteoftfaeinismatc^ Thus, whra the 
annealed KNA prq)aration is separated on an electrophoretic gel matrix, if a mismatch has been 
detected and cleaved by RNase A, an RNA product will be seen whidi is smaller than the full length 

1 0 duplex RNA for the riboprobe and the niRNA or DNA. The riboprobe need not be the full lengdi of 
the llcby mRNA or gent but can be a segment of ^er. If Ae riboprobe comprises only a s^ment of 
the llcby mRNA or gene, it will be desirable to use a number of &ese probes to screen die whole 
mRNA sequence for mismatdies. 

In similar fashion, DNA probes can be used to detect mismatdies, through enzymatic or 

IS ch^cal deavage. See e.g.. Cotton et al., Proc. Natl. Acad. Sci. USA 85:4397, 1988; Shenk et al., 
Proc. Natl. Acad. Sci. USA 72:989, 1975; Novadc et al., Proc. Nafl. Acad. Sd. USA 83:586, 1986. 
Alternatively, mismatches can be detected by shifts in die dectrophor^c mobility of mismatdied 
duplexes rdative to matdied duplexes. See, e.g., Caridlo, Human Gaieties 42:726, 1988. Widi 
either riboprobes or DNA probes, the cellular mRNA or DNA whidi mi^t contam a mutation can be 

20 anq>lified usmg PGR before hybridization. Changes in DNA of the 1 Icby gene can also be detected 
using Southern faybridizadon, especially if the changes are gross rearrang^nents, such as ddetions and 
insertions. 

For allde-specific PGR, primers are used whidi hybridize at Ifaeir 3* etKls to a particular 1 icby 
mutatbn. If the particular lld)y mutation is not present, an anqdification product is not observed. 
25 Anq>Iification Refractory Mutation System (ARMS) can also be used as disdosed in European Patent 
Application Publication No. 0332435 and in Newton et al. ^ewton et al., Nucl. Acids Res. 17:2503- 
2516, 1989). By use of a batt^ of such allde-specific probes, PGR amplification products can be 
saeened to idaitify the presence of a previously identified mutation in the 1 Icby g^e. 

The primer pairs can be made which are useful for determination of the nucleotide sequence of 
30 a particular 1 Icby allde using PGR. The pairs of single-stranded DNA primers can be annealed to 
^.^sequences witfiin or surrounding the llcby gene on chromosome 22ql3 in order to prime amplifying 
DNA syndesis of die llcby gene itsdf. A con5)lete set of diese primers allows synfliesis of all of the 
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nudeotides of the llcby gene coding sequences, i.e., the exons. The set of primers also pref^-ably 
allows qnatfaesis of both intron and exon sequences. As described above, allde-specific primers can 
also be used. Such primal anneal only to particular 1 Icby mutant alldes, and thus will only amplify a 
product in the presence of the mutant allele as a template. 

5 In order to facilitate subsequent doning of amplified sequ^ces, primer may have restriction 

Misyme site sequences appended to their 5* ends. Thus, all nucleotides of the primers are derived from 
sequences wifliin or adjacent to 1 Idjy gene, except for die few nucleotides necessary to form a 
restriction enagmie site. Such emsymes and sites are well known in the art. The primers themsdves 
can be synfliesized using tedmiques whidi are wdl known in the art. Generally, the primers can be 

1 0 made using oligonudeotide synthesizing machines which are commercially available. Ghren the 
sequence shown in SEQ ID N0:1, design of particular primers is wdl within the skill of die art. 

The nudeic acid probes/fragmaits provided by the present mvention are usefiil for a number 
of purposes. They can be used m SouthOTi hybridizadon to genomic DNA and m the RNase 
protection mefliod for d^ecdng point mutations already discussed above. The probes can be used to 

1 5 detect mismtadies with die 1 Icby gene or mRNA using other tedmiques. They can be used as probes 
in Norfliem blotting to delect die expression of die 11 cby gaie product or in the probing of micracrays 
of polynucleotides on a grid to look for rdated g^es or as a member of sudi a grid to test for gene 
expression of many genes. Lockhart DJ, Dong H, Byrne MC, FoUeltie MT, Gallo MV, Chee MS, 
Mittmann M, Wang C, Kobayashi M, Horton H, Brown EL Expression mcmtortng by hybridization to 

20 high-density oligonucleotide arrays, 

Nat Biotedmol 1996 Dec; 14(13): 1675-80 Wodicka L, Dong H, Mittmann M, Ho MH, Lockhart DJ 
Genome-wide egression monitoring in Saccharoniyces cerevisiae, Nat Biotechnol 1997 
Dec;15(13): 1359-67. Also they can be used to probe gene diip arrays looking for variants between 
individuals as a method for polymorphism discovery or diagnosdcally. Cronin MT, Fucini RV, Khn 

25 SM, Masmo RS, Wespi RM, Miyada CG Cystic fibrosis mutation detection by hybridizadon to light- 
generated DNA probe arrays. Hum Mutat 1996;7(3):244-55. 

The present invention also rdates to genetic markers. Got^c markers are essential for linking 
a disease to a r^ion of a chromosome. Such markers include restriction fragment length polymorphism 
(RFLPs) (Botstein et al.. Am. J Hum. Genet. 32:314-331, 1980), markers wifli a variable number of 

30 tandem repeats (VNTRs) (Jeffreys et al., Nature, 314:67-73,1985; Nakamura el al., Science 235:1616- 
^622, 1987), and an abundant dass of DNA polymorphisms on short tandem repeats (STRs), 
espedany repeats of CjpA (Weber and May, Am. J. Hum. Genet. 44:388-396, 1989; Lilt et al.. Am, J. 
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Hum. Genet. 44:397-401, 1989) and single nucleotide polymorphisms (SNPs), Large-scale 
identification, mapping, and g^otyping of single-nucleotide polymoiphisms in the human genome, 
Wang DG, Fan JB, Siao O, Bemo A, Young P, Sapolsky R, Ghandour G, P^kins N, Winchester E, 
Spaica: J, Kruglyak L, Stein L, Hsie L, Topaloglou T, Hubbdl E, Robinson E, Mittmann M, Morris 
5 MS, Shen N, Kilbum D, Rioux J, Nusbaum C, Rozen S, Hudson TJ, Lander ES, et al„ Science 1998 
May 15;280 (5366): 1077-82. To generate a genetic map, one selects potential genetic markers and 
test them using DNA extracted from members of the kindreds being studied. Hie polynucleotide 
of SEQ ID NO: 1 may contain polymorphic markers, and are therefore suitable for genetic 
association studies searching for disease susceptibility and/or tfa^apeutic (e.g. drug) response 

10 genes. Thus in another aspect, die invention provides a method of performing genetic association 
studies for searching a disease susceptibility genes or individual's likelihood of responding to a 
particular therapeutic intervention, comprising using polymorphic markers in the polynucleotide 
of SEQ ID NO: 1 and associating with a disease suscq)tibility genes or widi likelihood of 
respondmg to a particular therapeutic intervention to the polymorphic markers. 

15 &i the present method, the phrase "screening a human subject for a risk of devdoping a 

Disease*' mieans screening a human subject for the pres^ce or absence of a genetic mark^ (l.e., a 
1 Icby gene mutation), the presence of which corrdates widi an increased potential for devdoping the 
Disease. By "increased potential for devdoping a Disease" is meant a greats likelihood (|)robability) of 
devdoping the Disease than would odierwise be predicted (statistically), e.g. from the prevalence of the 

20 disorder in the general population or in the subject's ^fanic population, the individual's age, sex, and 
the like. A person (e.g., a medical practitioner or odier health care practition^) can estimate a 
probability ttiat ai^ particular individual will devdop a Disease flirougfa scteemig a human subject for 
a risk of devdoping a Disease. 

In another aspect, the invention provides a kit for screening a human subject for an 

25 increased potoitial of developing a Disease, the kit comprising: an assay means for assaying a 
sample of genomic DNA from a human subject for the presence of an llcby gene mutation, 
wherein the assay means produces a positive llcby gene mutation assay result if llcby gene 
mutation is present in the DNA of the subject, and the assay means produces a negative assay 
result if the DNA of the subject is free of the llcby gene mutation; and a means for corrdating a 

30 positive llcby mutation assay result to an increased potential of developing a Disease. 

' Exemplary assay means include materials for performing molecular biological techniques 

to detennine the presence of an llcby gene mutation. Thus, the assay means may include DNA 
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sequencing materials, PCR materials, restriction digesdon/southem hybridization materials, and 
combinations thereof. 

By a means for corrdating a positive 1 Icby gene mutation assay result to an increased 
potential of developing a Disease is meant a chart, table, graph, te^ct, or other ref(^ence that permits a 
S medical practitioner to corrdate assay results for a particular human subject to statistical risk for 
developing a particular Disease. For example, the chart, table, graph, or text is based on assay data 
disclosed hereia, or upon further assay data obtained according to the teachings herem. The chart, 
table, graph, or text, or the data contained therein, may be provided in computer-readable form (e.g., 
on a floppy disc or compact disc). 
10 Thus in another aspect, the present invention relates to a diagonostic kit which comprises: 

(a) a polynucleotide of the present invention, preferably the nucleotide sequence of SEQ ID NO: 
1, or a fragment thereof ; or 

(b) a nucleotide sequence complemmtary to diat of (a); 

It will be appreciated that in any such kit, (a) or (b) may comprise a substantial 

1 5 compon^t. Sudi a kit will be of use in diagnosmg a disease or suspectability to a disease, 

particularly bacterial, fungal, protozoan and viral mfecdons, particularly infection caused }yy HIV-1 or 
HIV-2; pain; cancers; diabetes; obesity; feeding and drinking abnormalities, such as anorexia and 
bulimia; asthma; Parkinson's disease; both acute and congestive heart Mure; hypotension; 
hypertension; urinary retention; osteoporosis; angina pectoris; myocardial infarction; ulc^; all^gi^; 

20 b^gn prostatic hypertrophy and psychotic and neurological disorders, including anxiety, 

sdiizophrenia, manic dq)ression, delirium, dem^tia or severe mental retardation, and dyskinesias, 
such as Huntington's disease or Gilles dela Tourett's syndrome, amongst others. 

In a related aspect, the invention provides a genetic counseling method comprismg the 
steps of: (a) isolating genomic DNA from a human individual; (b) assaying the DNA for a llcby 

25 gene mutation relative to DNA of a normal human subject; (c) determining a potential for 

developing a Disease in the human individual, wherein the presence of the llcby gene mutation in 
the genomic DNA of the human individual is indicative of an increased potential for developing 
the Disease; and (d) advising the human individual with respect to the individual's potential for 
developing the Disease. By "advising" the human individual is meant, e.g., providing the 

30 mdividual with the assay results and with the interpretation of those results (i.e., eitiier normal or 
^ increased potential for developing a Disease). 
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The nucleotide sequences of the present invention are also valuable for diromosome 
identification. The sequence is specifically targeted to, and can hybridize with, a particular location on 
an individual human chromosome. Hie moping of relevant sequences to diromosomes according to 
the present invention is an important first step in corrdating those sequences with gene associated 

5 disease. Once a sequence has been mapped to a precise diromosomal location, the physical position of 
the sequence on the chromosome can be correlated witii genetic map data. Such data are found in, for 
example, V. McKusick, Mraddian Inheritance in Man (available on-lme through Johns Hopkins 
UnivCTsity Wddi Medical Library), Tlie relationship between genes and diseases that have been 
m25)ped to the same chromosomal region are then identified throu^ linkage analysis (coinheritance of 

1 0 physically adjac^t genes) or direct detection of mutations m the gene itsdf . 

The geoe of the present invention m^s to human chromosome 22ql3. Hie diromosomal 
location which contains llcby gene is called llcby locus. An mdividual carries two homologous 
diromosomes, thus an individual possess two lld)y alleles. 

Another aspect of the invention relates to a method for inducmg an immunological 

15 response in a mflmmal which comprises inoculating the mammal with a polypeptide of the present 
invention, adequate to produce antibody and/or T cdl immune response to protect said animal 
from the Diseases herembefore mentioned, amongst others. Yet another aspect of die invention 
relates to a mefliod of inducing immunological response in a mammal which comprises, ddivering 
a polypeptide of the present invention via a vector directmg expression of the polynudeotide of 

20 the present invention and coding for the polypeptide in vivo in order to induce such an 
immunological response to produce antibody to protect said animal from Diseases. 

In a forflier aspect, the preset invention provides methods of treating abnormal conditions 
sudi as, for instance, bacterial, fungal, protozoan and viral infections, particularly infection caused by 
HTV-l or HIV-2; pain; cancers; diabetes; obesity; fiseding and drinking abnormalities, sudi as anorexia 

25 and bulimia; asthma; Parkinson's disease; both acute and congestive heart failure; hypotaasion; 

hypertensbn; urinary retention; osteoporosis; angina pectoris; myocardial infarction; ulc^s; allergies; 
benign prostatic hypertrophy and psychotic and neurological disorders, including anxiety, 
sdiizophrraia, manic depression, delirium, dementia or severe mental retardation, and dyskinesias, 
sudi as Huntington's disease or Gilles dda Tourett's syndrome, rdated to either an excess of, or an 

3 0 under-^q)ression of, 1 Icby polypeptide activity, 

_ ^ ' If the activity of the polypq)tide is in excess, several approadies are available. One approach 
coxi5)rises inhibiting expression of the gene encoding endogenous llcby polypq>tide. Known such 



wo 00/37113 



PCT/US99/30622 



18 

techniques involve the use of antisense sequences, either internally generated or separately 
administered (see, for example, O'Connor, JNeurochem (1991) 56:560 in Oligodeoxynucleotides 
as Antisense Inhibitors of Gene Expression, CRC Press, Boca Raton, FL (1988)). Alternatively, 
oligonucleotides which form triple helices with the gene can be supplied (see, for example, Lee et 
5 al. , Nucleic Adds Res (1979) 6:3073; Cooney et al. , Science (1988) 241 :456; Dervan et al , 
Science (1991) 251:1360). These oligomers can be administered per se or Ae relevant oligomers 
can be expressed in vivo. 

For treating abnormal conditions rdated to an under-€?q)ression of 1 Icby and its activity, 
several approaches are also available. One approach involves gene therapy to effect the endogenous 

10 production of Ucby by the relevant cells in the subject. More specifically, according to the 

present invention, a method is also provided of supplying wild-type llcby gene function to a cell 
which carries mutant 1 Icby alleles. Hie wild-type 1 Icby gene or a part of the gene m^ be 
introduced into the cell in a vector such that the gene remains extrachromosomal. In such a 
situation, the gene will be e)q)ressed by the cell from the extrachromosomal location. More 

15 preferred is the situation where the wild-type llcby gene or a part thereof is introduced into the 
mutant cell in such a way that it recombines with the endogenous mutant llcby gene present in 
die cell. Such recombination requires a double recombination event which results in the 
correction of the 1 Icby gene mutation. Vectors for mtroduction of genes both for recombination 
and for extrachromosomal maintenance are know in the art, and any suitable vector may be used. 

20 Methods for introducing DNA into cells such as electroporation, calcium phosphate co- 
precipitation and viral transduction are known in the art, and the choice of method is within tlie 
comp^ence of the routineer. C^ells transformed with the wild-type llcby gene can be used as 
model systems to study a Disease and drug treatments to treat the Disease. 

As generally discussed above, the llcby gene or fragment where applicable, may be 

25 employed in gene therapy methods in order to increase the amount of the expression products of 
such gene in cells in need thereof. Such gene therapy is particularly appropriate for use in cells 
in which wild type llcby is under-expressed, and in which the level of llcby polypeptide is 
absent or diminished compared to normal cells. It may also be useful to increase the level of 
repression of a given llcby gene even in cells in which the mutant gene is expressed at a 

30 "normal" level, but the gene product is not fully functional. 

Gene therapy would be carried out according to generally accepted methods, for example, 
as described by Friedman, T., In Therapy for Genetic Diseases, T. Frideman, ed., Oxford 
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University Press, pp. 105-121, 1991. A virus or plasmid vector (see below), containing a copy 
of the llcby gene linked to expression control elements and cs^able of replicating inside the cells, 
is prepared. Suitable vectors are known, such as disclosed in U.S. Pat. No. 5,252,479 and PCT 
published application WO 93/07282. The vector is then injected in the patient, either locally at 
5 the site of the cell, tissue or organ, in need thereof, or systemically. If the transfected gene is not 
permanently incorporated into the genome of each of the targeted cells, the treatment may have to 
be repeated periodically. 

Gene transfer systems known in the art may be useful in the practice of the gene ther^y 
methods of the present invention. These include viral and nonviral transfer methods. A number 

10 of viruses have bem used as gene transfer vectors, includmg papovariuses, e.g., SV40 (Madzak 
et al., I. Gen. Virol, 73:1533-1536, 1992), adenovirus (Berkner, Curr. Top. Microbiol. 
Immunol., 158:39-61, 1992; Bemker et al.. Bio Tedmiques 6:616-629, 1988; Gorziglia and 
Kapikian, J. Virol. 66:4407-4412, 1992; Quantm et al., Proc. Natl. Acad. Sci. USA 89:2851- 
2584, 1992; Rosenfdd et al., Cell 68:143-155, 1992; Wilkmson et al.. Nucleic Acids Res. 

15 20:2233-2239, 1992; Stratford-Perricaudet et al.. Hum. Gene Ther. 1:241-256, 1990), vaccinia 
virus (Moss, Cun. Top. Microbiol. Immunol. 158:25-38, 1992) adeno-associated virus 
(Muzyczka, Curr. Top. Microbiol. Immunol. 158:97-123, 1992; Ohi et al.. Gene 89:279-282, 
1990), h^es viruses mcluding HSV and EBV (Margolskee, Curr. Top. Microbiol. Immunol. 
158:67-90, 1992; Johnson et al, J. Virol. 66: 2952-2965, 1992; Fink et al., Hum. Gene Hier. 

20 3:11-19, 1992; Breakfield and Geller, Mol. Neurobiol. 1:337-371, 1987; Freese et al., Biochem. 
Pharmacol. 40:2189-2199, 1990), and retroviruses of avian (Brandyopadhyay and Temm, Mol, 
Cell. Biol. 4:749-754, 1984; Petropoulos et al., J. Virol. 66:3391-3397, 1992), murine (Miller, 
Curr. Top. Microbiol. Immunol. 158:1-24, 1992; Miller et al., Mol. CeU. Biol. 5:431-437, 1985; 
Sorge et al., MoL Cdl. BioL 4:1730-1737, 1984; Mann and Baltimore, J. Virol. 54:401^7, 

25 1985; MUler et al., J. Virol. 62:4337-4345, 1988, and human origm (Shimada et al., J. Qin. 
Invest. 88:1043-1047, 1991; Helseth et al., J. Virol. 64:2416-2420, 1990; Page et al., J. Virol. 
64:5370-5276, 1990; Buchschacher and Panganiban, J. Vkol. 66:2731-2739, 1992). 

Nonviral gene transfer methods known in the art include chemical techniques such as 
calcium phosphate coprecipitation (Graham and van der EB., Vkology 52: 456-467, 1973; 

30 Pdlicer et al.. Science 209:1414-1422, 1980); mechanical techniques, for example microinjection 
HAndarson et al., Proc. Nafl. Acad, Sci. USA 77:5399-5403, 1980; Gordon et al., Proc. Nati. 
Acad. Sd. USA 77:7380-7384, 1980; Brmster et al., Cell 27:223-231, 1981; Constantini and 
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Lacy, Nature 294:92-94, 1981); membrane fusion-mediated transfer via liposomes (Feigner et al., 
Proc, Nad. Acad. Sci. USA 84:7413-7417, 1987; Wang and Huang, Biochemistry 28:9508-9514, 
1989; Kaneda et al., J, Biol. Chem. 264:12126-12129, 1989; Stewart et al., Hum. Gene Ther. 
3:267-275, 1992; Nabel et al.. Science 249:1285-1288, 1990); and direct DNA uptake and 
5 receptor-mediated DNA transfer (Wolff et al.. Science 247:1465-1468, 1990; Wu et al., J. Biol. 
Chem. 266:14338-14342, 1991; Wagner et al., Proc. Natl. Acad. Sci. USA 87:3410-3414, 1990; 
Wagner et al., Proc. Natl. Acad. Sci. USA 88:4255-4259, 1991; Gotten et al., Proc. Natl. Acad, 
Sci. USA 87:4033-4037, 1990; Curiel et al., Proc. Natl. Acad. Sci. USA 88:8850-8854, 1991; 
Curiel et al., Hum. Gene Ther. 3:147-154, 1991). Viral-mediated gene transfer can be combined 

10 with direct in vivo gene transfer using liposome delivery, allowing one to direct flie viral vectors 
to the tumor cells, if a Disease is tumor, and not into the surrounding nondividing cells. 
Alternatively, the retroviral vector produced cell line can be injected into tumors (Culver et al., 
Science, 256:1550-1552, 1992). Injection of producer cells would then provide a continuous 
source of vector particles. 

15 In an approach which combines biological and physical gene transfer methods, plasmid 

DNA of any size is combined with a polylysine-conjugated antibody specific to the adenovirus 
hexon protein, and the resulting complex is bound to an adenovirus vector. The trimolecular 
comply is then used to infect cells. The adenovkus vector permits efficient bmding, 
internalization, and degradation of the endosome before the coupled DNA is damaged. 

20 Liposome/DNA complexes have been shown to be capable of mediating direct in vivo 

gene transfer. While in standard liposome preparations the gene transfer process is nonspecific, 
localized in vivo uptake and expression have been rq)orted in tumor deposits, for example, 
foUowmg direct m situ administration (Nable, Hum. Gene Ther. 3:399-410, 1992). 

Gene transfer techniques which target DNA directly to tissues in need thereof is 

25 preferred. Receptor-mediated gene transfer, for example, is accomplished by the conjugation of 
. DNA (usually in the form of covalently closed supercoiled plasmid) to a protein ligand via 
polylysine. Ligands are chosen on the basis of the presence of the corresponding ligand receptors 
on the cell surface of the target cell/tissue type. One appropriate receptor/ligand pair may 
mclude, for example, the estrogen receptor and its ligand, estrogen (and estrogen analogues). 

3 0 These ligand-DNA conjugates can be injected directly into the blood if desired and are directed to 
-&e target tissue where receptor binding and internalization of the DNA-protem complex occurs. 
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To overcome the problem of intracellular destruction of DNA, coinfection with adenovirus can be 
included to disrupt endosome function. 

Also important is the development of e^erimental models of a Disease. Such models can 
be used to screen for agents that alter the progression of a Disease and to study pathological 
5 course of a Disease. Once a mutation causing a Disease is identified, it is possible using genetic 
manipulation to develop transgenic model systems and/or whole cell systems containing a mutated 
Ucby gene or a portion thereof. The model systems can be used for screening drugs and 
evaluating the efficacy of drugs in treating a Disease. In addition, these model systems provide a 
tool for defining the underlying biochemistry of llcby polypeptide and its relationship to a 

10 Disease thereby providing a basis for rational drug design. 

One type of cell system which can be used in the present invention can be naturally 
derived. For this, blood samples from an affected individual are obtained and permanently 
transformed into a lymphoblastoid cell line using, for example, Epstein-Barr virus. Once 
established, such cell lines can be grown continuously in suspension cultures and can be used in a 

15 variety of in vitro experiments to study llcby expression and processing. Another cell line used 
in diese studies comprises skin fibroblasts, derived from patients. Further cell cultures can be 
isolated form the transgenic animals or prepared from established cell cultures using the same 
constructs with standard cell transfection techniques. 

A mutant llcby gene of the present invention can be cloned and placed in a cloning 

20 vector. Examples of clonmg vectors which can be used include, but are not limited to, 

^charonSS, cosmid, or yeast artificial chromosome. Heterologous promoter sequences can also 
be used to control expression of llcby coding sequences. The ability to regulate expression of 
the llcby gene in transgenic animals is believed to be useful in evaluating the roles of the 
different llcby gene products in a Disease. The ability to regulate expression of the llcby gene 

25 in cultured cells is believed to be useful in evaluating expression and processing of the different 
llcby gene products and may provide the basis for cell cultured drug screens. The 
metallothionine (MTT) promoter is well characterized, has been employed in transgenic animals, 
and its expression can be regulated through modulation of zinc and glucocorticoid hormone levels 
(Palmiter et al., Nature 300, 611-615, 1982). Other alternatives for use in controlling human 

30 1 Icby expression include, but are not limited to, the human S actin gene promoter (Ray et al., 
-^Genes and Development 5:2265-2273, 1991), the human platelet derived growth factor B 
(PDGFB) chain gene promoter (Sasahara et al.. Cell 64:217-227, 1991), the rat sodium channel 
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gene promoter (Maue et al, Neuron 4:223-231, 1990), the human copper-zinc superoxide 
dismutase gene promoter (Ceballos-Picot et al., Bram Res. 552:198-214, 1991), and promoters 
for members of the mammalian POU-domain regulatory gene family (Xi et al, Nature 340:35-42, 
1989). 

5 Large segments of human genomic DNA, when cloned into certain vectors, can be 

propagated as autonomously-replicating units in the yeast cell. Such vector-borne segments are 
referred to as yeast artificial chromosomes (YAC; Burke et al.. Science 236:806 1987). A human 
YAC library is conunercially available (Clonetech, Palo alto, Calif.). Other sources of YAC 
libraries can be obtained from a number of commercial or public sources including Research 
10 Genetics or genome Systems, UK Human Genome Resource Mapping Centre, libraries include 
the CEPH meaga YACs, the Id YACs and the ICRF YACs with an average insert size of 
250,000 base pairs (range of 180,000 to 500,000 base pairs). A YAC clone can be directly 
isolated by screening the library with the human llcby cDNA. Other t^es of large insert 
libraries are also available in different vectors for example PACs and BACs titiese have average 
15 insert sizes of 100-120 000 base pairs. These can be obtamed from commercial and public 
d^sitories including Research Genetics and Genome Systems. The inclusion of all of the 
essential gene regions in the clone can be confirmed by PCR analysis. 

The YAC-llcby clone is established in embryonic stem (ES) cells by selecting for 
neomycm resistance encoded by the YAC vector. ES cells bearing the YAC-llcby clone are used 
20 to produce transgenic mice by established methods. The YAC-llcby gene bearing a mutation can 
be produced through the generation of a YAC library using genomic DNA from a person affected 
by a coutation. The clone is identified and established in ES cells as described above. 

DNA molecules introduced into ES cells can also be integrated into die chromosome 
through the process of homologous recombination, described by Capecchi, MR in Science, 
25 244:1288-1292 (1989). 

The mutant llcby gene can then be transferred to a host nonhuman animal such as a 
mouse. As a result of the transfer, the resultant transgenic nonhuman animal will preferably 
egress one or more of the variant 1 Icby polypeptides. 

Additional methods for producing transgenic animals are well known in the art. 
30 Transgenic animals are used in the assessment of new therapeutic compositions, as 

^^cemplified by U.S. Patent 5,223,610. These animals are also used in the development of 
predictive animal models for human disease states, as exemplified in U.S. Patent 5,221,778. For 
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example, transgenic animals are developed for assessing Alzheimer's disease (U.S. Patent 
7,769,626), multi-drug resistance to anticancer agents (U.S. Patent 7,260,827), and carcinogenic 
substances (U.S. Patent 4,736,866). Therefore, the mutated genes of the present invention which 
are believed to cause a Disease in Chromosome 22ql3-linked pedigrees provide a useful means 
5 for developing transgenic animals to assess this Disease. 

Transgenic animal technology also offers a whole animal expression-cloning system in 
which introduced genes are expressed to give large amounts of lldjy polypeptide of the present 
invention. Thus in one aspect the invention relates to a transgenic animal comprising a llcby 
polynucleotide. 

1 0 Polynucleotide and polypeptide sequences fyrm a valuable information resource with which to 

identify further sequences of similar homology. This is most easily fiacilitated by storing the sequence 
in a computer readable medium and dien using the stored data to search a sequence database using 
wdl known searching tools, sudi as GCC. Accordingly, in a furflio: aspect, the present invention 
provides for a computer readable medium having stored thereon a polynucleotide comprising the 

15 sequence of SEQ ID N0:1 and/or a polypeptide sequence Micoded thereby. 

The following definitions are provided to facilitate understanding of certain terms used 
frequently hereinbefore. 

"Isolated** means altered "by the hand of man" from the natural state. If an "isolated" 
composition or substance occurs in nature, it has been changed or removed from its original 

20 envkonment, or both. For example, a polynucleotide or a polypeptide naturally present in a 
living animal is not "isolated," but the same polynucleotide or polypeptide separated from the 
coexisting materials of its natural state is "isolated", as the term is employed herein. 

"Polynucleotide" generally refers to any polyribonucleotide or polydeoxribonudeotide, 
which may be unmodified RNA or DNA or modified RNA or DNA. "Polynucleotides" include, 

25 without limitation, single- and double-stranded DNA, DNA that is a mixture of single- and 

double-stranded regions, single- and double-stranded RNA, and RNA that is mixture of single- 
and double-stranded regions, hybrid molecules comprising DNA and RNA that may be single- 
stranded or, more typically, double-stranded or a mixture of single- and double-stranded regions. 
In addition, "polynucleotide" refers to triple-stranded regions comprising RNA or DNA or both 

30 RNA and DNA. The term "polynucleotide" also includes DNAs or RNAs containing one or 
...jnore modified bases and DNAs or RNAs with backbones modified for stability or for other 
reasons. "Modified" bases include, for example, tritylated bases and unusual bases such as 



wo 00/37113 



PCT/US99y30622 



24 

inosine. A variety of modifications may be made to DNA and RNA; thus, **polynucleotide'* 
embraces chemically, enzymatically or metabolically modified forms of polynucleotides as 
typically found in nature, as well as the chemical forms of DNA and RNA characteristic of 
viruses and cells. "Polynucleotide*' also embraces relativdy short polynucleotides, often referred 
S to as oligonucleotides. 

"Polypeptide" refers to any peptide or protein comprising two or more amino acids joined 
to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. "Polyp^tide" 
refers to both short chains, commonly referred to as peptides, oligopeptides or oligomers, and to 
longer chains, generally ref^ed to as proteins. Polypeptides may contain amino acids other tiian 

10 the 20 gene-encoded amino acids. "Polypeptides" include amino acid sequences modified either 
by natural processes, such as post-translational processmg, or by chemical modification 
techniques which are well known in the art. Such modifications are well described in basic texts 
and in more detailed monographs, as well as in a voluminous researdi literature. Modifications 
may occur anjrwhere in a polypeptide, including the pq)tide backbone, the amino acid side-chains 

15 and the ammo or carbo^qrl termini. It will be appreciated that the same type of modification may 
be present to the same or varying degrees at several sites in a given polypeptide. Also, a givem 
polypeptide may contain many types of modifications. Polypeptides mtxy be branched as a result 
of ubiquitination, and fliey may be cyclic, with or witiiout brandimg. Cyclic, branched and 
branched cyclic polypeptides may result from post-translation natural processes or may be made 

20 by synthetic methods. Modifications include acetylation, acylation, ADP-ribosylation, amidation, 
covalent attachment of flavin, covalent attachment of a heme moiety, covalent attadiment of a 
nucleotide or nucleotide derivative, covalent attachment of a lipid or lipid derivative, covalent 
attachment of phosphotidylmositol, cross-linking, cyclization, disulfide bond formation, 
demethylation, formation of covalent cross-links, formation of cystine, formation of 

25 pyroglutamate, formylation, ganuna-carboxylation, glycosylation, GPI anchor formation, 
hydroxylation, iodiaation, methylation, myristoylation, oxidation, proteolytic processing, 
phosphorylation, prenylation, racemization, selenoylation, sulfation, transfer-RNA mediated 
addition of amino acids to proteins such as arginylation, and ubiquitination (see, for instance, 
PROTEINS-STRUCnTJRE AND MOLECULAR PROPERTIES, 2nd Ed., T. E. Creighton, 

30 W. H. Freeman and Company, New York, 1993; Wold, F., Post-translational Protein 

-J^odifications: Perspectives and Prospects, pgs. 1-12 in POSTTRANSLATIONAL COVALENT 
MODMCATION OF PROTEINS, B. C. Johnson, Ed., Academic Press, New York, 1983; 
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Seifito et a/., ** Analysis for protein modifications and nonprotein cofactors", Meth Enzymol 
(1990) 182:626-646 and Rattan et al, "Protein Synthesis: Post-translational Modifications and 
Aging**, Am NY Acad Sci (1992) 663:48-62). 

"Variant" refers to a polynucleotide or polypeptide that differs from a reference 
5 polynucleotide or polypeptide, but retains essential properties. A typical variant of a 

polynucleotide differs in nucleotide sequence from another, reference polynucleotide. Changes in 
the nucleotide sequmce of the variant may or may not alter the amino acid sequence of a 
polyp^tide encoded by the reference polynucleotide. Nucleotide changes may result in amino 
acid substitutions, additions, deletions, fusions and truncations or chromosomal translocations in 

10 the polypeptide encoded by the reference sequence, as discussed below. A typical variant of a 
polypeptide difBers in amino acid sequence from another, referrace polypeptide. Generally, 
differences are limited so that the sequences of the reference polypeptide and the variant are 
closely similar overall and, in many regions, identical. A variant and reference polypq>tide may 
diffi^ in amino acid sequence by one or more substitutions, additions, deletions in any 

15 combination. A substituted or inserted amino acid residue may or may not be one encoded by the 
genetic code. A variant of a polynucleotide or polypeptide may be a naturally occurring such as 
an allelic variant, or it may be a variant that is not known to occur naturally. Non-naturally 
occurring variants of polynucleotides and polypeptides may be made by mutagenesis techniques or 
by direct synthesis. 

20 "Identity," as known in the art, is a relationship between two or more polypeptide 

sequences or two or more polynucleotide sequences, as determined by comparing the sequences. 
In the art, "identity" also means the degree of sequence relatedness between polypeptide or 
polynucleotide sequences, as the case may be, as determined by the match between strings of such 
sequences. "Identity" and "similarity" can be readily calculated by known methods, including but 

25 not limited to those described m (Computational Molecular Biology y Lesk, A.M., ed., Oxford 
University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, 
D.W., ed., Acadanic Press, New York, 1993; Computer Analysis of Sequence DatOy Part I, 
Griffin, A.M., and Griffin, H.G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in 
Molecular Biology^ von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer ^ 

30 Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and (Zarillo, H., and 
-"^Lipman, D., SIAM J. Applied MatK , 48: 1073 (1988). Preferred methods to determine identity 
are designed to give tiie largest match between the sequences tested. Methods to determine 
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identity and similarity are codified in publicly available computer programs. Preferred computer 
program methods to determine identity and similarity between two sequences include, but are not 
limited to, the GCG program package Pevereux, J., et al., Nucleic Adds Research 12(1): 387 
(1984)), BLAST?, BLASTN, and FASTA (Atschul, S.F. et al., /. Molec. Biol 215: 403-410 
5 (1990). The BLAST X program is publicly available iirom NC3I and other sources {BLAST 
Manual, Altschul, S., et al, NCBI NLM NIH Bethesda, MD 20894; Altschul, S., et al.,J. Mol 
Biol 215: 403-410 (1990). The well known Smith Waterman algorithm may also be used to 
determine identity. 

Preferred parameters for polypeptide sequence comparison include Ae foUowmg: 
10 1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

Comparison matrix: BLOSSUM62 from Hentikoff and Hentikoff, Proc. Natl, Acad. Sci. USA, 
89:10915-10919 (1992) 
Gap Penalty: 12 
Gzp Length Penalty: 4 

15 A program useful with these paramet^s is publicly available as tiie "gap" program from 

Genetics Computer Group, Madison WI. The aforementioned parameters are the default 
parameters for peptide comparisons (along with no penalty for end gaps). 

Preferred parameters for polynucleotide comparison mclude the following: 
1) Algorithm: Needleman and Wunsch, J. Mol Biol. 48: 443-453 (1970) 

20 Comparison matrix: matches + 10, mismatch = 0 
Gap Penalty: 50 
Gap Length Penalty: 3 

Available as: The *'gap" program from Genetics Computer Group, Madison WI. These are Ae 
default parameters for nucleic acid comparisons. 

25 By way of example, a polynucleotide sequence of the present invention may be id^tical 

. to the refi^ence sequence of 5EQ ID N0:1, that is be 100% identical, or it may include up to a 
certain integer number of nucleotide alterations as compared to the reference sequence. Such 
alterations are selected from the group consisting of at least one nucleotide deletion, substitution, 
induding transition and transversion, or msmion, and wherein said alterations may occur at the 

30 5' or 3* terminal positions of the reference nucleotide sequence or anywhere between those 
—terminal positions, interspersed either individually among the nucleotides in the ttfeim^t 
sequence or in one or more contiguous groups withm the reference sequence. Tlie number of 



wo 00/37113 



PCT/US99/30622 



nucleotide alterations is determined by multiplying the total number of nucleotides in SEQ ID 
NO: 1 by the numerical percent of the respective percent identily(divided by 100) and subtractmg 
that product from said total number of nucleotides in SEQ ED N0:1, or: 

nn^Xn-(xn-y), 

5 wherein Uj^ is the number of nucleotide alterations, Xji is the total number of nucleotides m SEQ 
ID N0:1, and y is, for instance, 0.70 for 70%, 0.80 for 80%, 0.85 for 85%, 0.90 for 90%, 0.95 
for 95%, etc., and wherem any non-integer product of and y is rounded down to the nearest 
mteger prior to subtracting it from Xq. Alterations of a polynucleotide sequence encoding the 
polypeptide of SEQ ID N0:2 may create nonsense, missense or frameshift mutations in this 

10 coding sequence and thereby alter the polypeptide encoded by the polynucleotide following sudi 
alterations. 

"Fusion protein" refers to a protein encoded by two, often unrelated, fused genes or 
fragments thereof. In one example, EP-A-0 464 discloses fusion proteins comprising various 
portions of constant region of immunoglobulm molecules together wfth another human protein or 
15 part thereof. In many cases, employing an inmiunoglobulin Fc region as a part of a fusion 
protein is advantageous for use in therapy and diagnosis resulting in, for example, improved 
pharmacokinetic properties [see, e.g., EP-A 0232 262]. On the other hand, for some uses it 
would be desirable to be able to delete the Fc part after the fusion protein has been expressed, 
detected and purified. 

20 All publications, including but not limited to patents and patent J5)plications, cited in this 

specification are herein incorporated by reference as if each individual publication were 
specifically and individually indicated to be incorporated by reference herein as though fiilly set 
forth 
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What is claimed is: 

1 . An isolated polynucleotide which comprises a polynucleotide which has at least: 

(a) 70% identity; 

(b) 80% identity; 

(c) 90% identity; 

(d) 95% identity; 

(e) 97% idoitity; or 

(f) 98% idootity 

to fliat of SEQ ID NO: 1 over the entire length of SEQ ID NO: 1. 

2. An isolated polynucleotide which comprises the polynucleotide of SEQ ID NO: 1. 

3. An isolated polynucleotide of SEQ ID NO: 1. 



15 4. An isolated polynucleotide which has at least 

(a) 70% identity; 

(b) 80% identity; 

(c) 90% identity; 

(d) 95% identity; 
20 (e) 97% identity; or 

(f) 98% identityto a polynucleotide selected from tiie group consistmg of 

a. SEQ ID N0:3; 

b. SEQIDN0:5; and 

c. SEQ ID N0:7; 

25 over the entire length of SEQ ID N0S:3, 5 and 7, respectively. 

5. An isolated polynucleotide comprising a polynucleotide which has at least 

(a) 70% identity; 

(b) 80% identity; 
30 (c) 90% idmtity; 

(d) 95% id^ty; 
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(e) 97% idfflitity; or 

(f) 98% identity 

to a polynucleotide selected from the group consisting of 
a. SEQ ID N0:4; and 
5 b. SEQ ID N0:6. 

over the entire length of SEQ ID N0:4 and 6, respectivdy . 

6. A polynucleotide which is complementary to any of the polynucleotides of claims 1 to 5. 

10 7. A method of detecting a presence of or an absense of variations in a 1 Icby allde in an 
individual from wild-type llcby gene sequence, comprising comparing the llcby polynucleotide 
sequences of the individual from that of wild-^e llcby gene sequraice. 

8. A method of detecting a presence of or an absense of variation in a 1 Icby allele in an 
15 individual from that of SEQ ID NO: 1 , comprising comparing a llcby allele sequence of the 

individual with that of SEQ ID NO: 1. 

9. A method of screening an individual for an increased risk of developing a Disease comprising 
the steps oft 

20 (a) assaying for a presence or absence of a 1 Icby polynucleotide mutation rdative to 1 Icby 

polynucleotide in a normal individual; and 

(b) correlating the presence of a mutation to an increased risk of developing a Disease, and 
absence of mutation to no increased risk of developmg a Disease. 

25 10. A method for diagnosing a Disease in an individual comprising: 

(a) assaying for a presence or absence of a 1 Icby polynucleotide mutation relative to 1 ld)y 
polynucleotide in a normal individual; and 

(b) correlating the presOTce of a mutation to a presoice of a Disease, and absence of mutation 
to an absence of a Disease. 



30 



••11 . A method of claim 7, 8, 9, or 10 whidi further conq)rises isolating DNA from said 
individual. 
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12. A method of performing genetic association studies for searching a disease susceptibility gaie 
or individual's likelihood of responding to a particular therapeutic intervention, comprising using 
polymorphic markers in the polynucleotide of SEQ ID NO: 1 and associating with the disease 
susceptibility gene or with likelihood of responding to a particular tiierapeutic intervention to the 

5 polymorphic markers. 

13. A method genetic counselmg comprising the steps of: 

(a) isolating genomic DNA from a human individual; 

(b) assaying the DNA for a mutation relative to Ucby polynucleotide of a normal human 

10 subject; 

(c) determining a potential for developing a Disease in the human individual, wherein the 
presence of the mutation in the genomic DNA of the human individual is indicative of an 
increased potential for developing the Disease; and 

(d) advising the human individual with respect to the individual's potential for developing 
15 the Disease. 

14. An expression system comprising a polynucleotide of daim 1 capable of producing a 
polypeptide when said expression system is present in a compatible host cdl. 

20 15. A process for producing a recombinant host cell comprising transforming or transfecting 
a cell with the expression system of claim 14 such the the host cell, under appropriate culture 
conditions, produces a polypeptide. 

16. A recombinant host cell produced by the process of claun 15, 

25 

17. A process for producing a polypeptide comprising culturing a host cell of claim 16 under 
conditions sufficient for the production of said polypeptide and recovermg the polypeptide from 
the culture. 

30 18. A method of providing an individual in need of Ucby polypeptide comprising providing a 
Icby polynucleotide through a gene th^apy . 
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19. A transgenic animal comprising a 1 Icby polynucleotide or a mutant thereof. 

20. A method for screening compounds effective for treating a Disease comprising: 
administering the compound to be tested to a transgenic animal comprising a mutated llcby 
polynucleotide and assessing whether the compound is effective to treat the Disease. 

5 

21 . An isolated polynucleotide comprising a polynucleotide which has at least: ' 

(a) 70% identity; 

(b) 80% identity; 

(c) 90% identity; 
10 (d) 95% identity; 

(e) 97% identity; or 

(f) 98% id^tity 

to a gDNA contained in ATCC Deposit number 98964 over the entire length gDNA. 
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SEQ ID N0:1 (Genomic Sequence) 

1 GAGGTCCTGG CTATTGCCAG CATGGAGTGA CCTGTGTCAC CTCTGAGTGC 

51 CAGGCAAGGG TTCAGCAGCT GACGACTCAG CTTCTGCAGG ATGCTGGCAG 

101 CATAGCCAGC GAGATAGTTG GAAGCCGTCA GGGCACAGGG AAGGG6CC6A 

151 GGGTGCCCTG AGTGTGCATG GGGGGCAGCC CTGCTGCAGT CCAAGCCTTT 

201 GATTCCCAAG CTATGTGCAC AGTTTCCTCT GGACTCTGCC ATGTGGCCCA 

251 GCCACCCATA CCTGGAATAG GGGCTAAGCC AAGCTGCTCT CTCCTCCAAA 

301 GGGAGGCAGC CTGTGTGCTT TGTCCGrXTG CCTTTGCAGA GACCTCGATC 

351 CTCACGCAAG GCAAGCAGCA GCCCCTGTAA GCACACGAGA CAATCCCAAG 

401 TGTCAGTGGG AAGGAGATCC CTTTCCTGAT GGGGCTGCCT GTGTCCAGTC 

45 1 CCTCCCAGCT TCCCCAGGGC CCTGGGGCTC TGCAGGCATT CAGAAGTGGA 

501 AGCCAGCCAC AGCCTGGGAC TGAAGAGGTT AATGTGCATC TGCCTCCGAA 

551 TGTTAATGTG TCTAGGTGAT GTCAGTGGGA GCCATGAAGA AGGGAGT6G6 

601 GAGGGCAGTT GGGCTTGGAG GCGGCAGCGG CTGCCAGGCT ACGGAGGAAG 

651 ACCCCCTTCC CGACTGCGGG GCTTGCGCTC CGGGACAAGG TGGCAGGCGC 

701 TGGAGGCTGC CGCAGCCTGC GTGG6TGGAG GGGA6CTCAG CTCGGTT6TG 

751 GGAGCA6GCG ACC6GCACTG GCTGGAT6GA CCTG6AA6CC TCGCTGCTGC 

801 CCACTGGTCC CAATGCCAGC AACACCTCTG ATGGCCCCGA TAACCTCACT 

851 TCGGCAGGTG AGTTGACTGG GAGCCCTCCC TCCTCTGGGC TGTGGGTGGA 

901 AAATGGGAAG GTTTCACCCC TGAGCCAAAC TGCTTGGGAA ACTTTATCAC 

951 AGTTCTTGGG GACAAGATCT GTGGTCTGCT TTGCTCTGAG GGGCAGGAGA 

1001 AAAGGGGGCA ATGGTCCGCA GGGGCAGACG GGCAGGAGCA GAGCAGGGGG 

1051 CGAAGGCATA TTCAGAATGG CAAGGAAGGG GGGCCAGCCG TGAGACAGCA 

1101 GGGGAAGGCT CGCTGCTGGG TTCCAAAGAT GCTTGGCAGA AAAAATTCCA 

1151 GGCTGGAAAA GCAAGCGAGA GAAGCTGGAG GGTGGTATGT GGGAGACAGC 

1201 TGGGGGCTCA CTGCTGCAGT GTTAGCCTCA GCTTTTTACT CCCACTTGGA 

1251 TGATGAGGTC TGAGACATCC TTACTGCCAC CTGGGAGAGG CCCTGGGAAG 

1301 GGAAGACTTC ACAGAGCCAT GAGGGGATTA ACTTTTCTGG TGAATTAAGC 

1351 TTCCTGACAT TTCCAGAGCT GCGGTGCCCT GGGATTCCAG CTTTGAAGGA 

1401 6AAAG6AAGG AAGGAAAAGA GGAAAGGCTT ATGTAGATAA TTTTTCCAG6 

1451 CTGCTGAGCT CCAACAGACA GTTTCTGTCT CTGCTTCACT CAAGAAGCCC 

1501 AGGCTCAGAA GATACCAATC AAG6AAATCC CCGCTAGGAA GCCT66GGTA 

1551 GGGAGAGCTG CTGGCTTGAC CAGGGCACAG CCGGCAAAAG CCTCTACAAG 

1601 ACAGTCACCC ACAGATATGC CCAAGAATCA GTACACAGTT TCCAACCA6A 

1651 GATCTCCAAA ATGAAACACT CAGG6CTACA CATA6GAAAA GCACGCACAC 

1701 ACACACACAC ACACACACAG ACACTTACTT TTGTGTCCTT CTGGCTATGC 

1751 TGACGAGTTT TCCTGGTGAA GCCCGGGGCT CACAGAGTAA TCTCTGCAGA 

1801 CAACTGTGGT TCTTGCCTCT GGTGCCTGCA GGAGGCAGGC ATGTTGTGTC 

1851 CTTCCAAGAC AGATGGCTCA GGGCACTCTG GTAGGATTCA CCAGGAAACT 

1901 CATGGAGAAG GGAAAAGGGA CAAGATTAGC AACAGTGAAG GGAGGGAGAA 

1951 TGGTGGGAGA GGATTGGAGA TGAACGGTGG GTCGCTGGAG GCTGAGCATG 

2001 CCAGCAGGAT GTCAGTTCTC AGAGCAAAGC CCATGTCAAA CAGCCAACGC 

2051 TTGCTCCTTC TGTCCCCAGG ATCACCTCCT CGCACGGGGA GCATCTCCTA 

2101 CATCAACATC ATCATGCCTT CGGTGTTCGG CACCATCTGC CTCCTGGGCA 
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2151 TCATCGGGAA CTCCACGGTC ATCTTCGCGG TCGTGAAGAA GTCCAAGCTG 

2201 CACTGGTGCA ACAACGTCCC CGACATCTTC ATCATCAACC TCTCGGTAGT 

2251 AGATCTCCTC TTTCTCCTGG GCATGCCCTT CATGATCCAC CAGCTCATGG 

2301 GGAATGGGGT GTGGCACTTT GGGGAGACCA TGTGCACCCT CATCACGGCC 

2351 ATGGATGCCA ATAGTCAGTT CACCAGCACC TACATCCTGA CCGCCATGGC 

2401 CATTGACCGC TACCTGGCCA CTGTCCACCC CATCTCTTCC ACGAAGTTCC 

2451 GGAAGCCCTC TGTGGCCACC CTGGTGATCT GCCTCCTGTG GGCCCTCTCC 

2501 TTCATCAGCA TCACCCCTGT GTGGCTGTAT GCCAGACTCA TCCCCTTCCC 

2551 AGGAGGTGCA GTGGGCTGCG GCATACGCCT GCCCAACCCA GACACTGACC 

2601 TCTACTGGTT CACCCTGTAC CAGTTTTTCC TGGCCTTTGC CCTGCCTTTT 

2651 GTGGTCATCA CAGCCGCATA CGTGAGGATC CTGCAGCGCA TGACGTCCTC 

2701 AGTGGCCCCC GCCTCCCAGC GCAGCATCCG GCTGCGGACA AAGAGGGTGA 

2751 CCCGCACAGC CATCGCCATC TGTCTGGTCT TCTTTGTGTG CTGGGCACCC 

2801 TACTATGTGC TACAGCTGAC CCAGTTGTCC ATCAGCCGCC CGACCCTCAC 

2851 CTTTGTCTAC TTATACAATG CGGCCATCAG CTTGGGCTAT GCCAACAGCT 

2901 GCCTCAACCC CTTTGTGTAC ATCGTGCTCT GTGAGACGTT CCGCAAACGC 

2951 TTGGTCCTGT CGGTGAAGCC TGCAGCCCAG GGGCAGCTTC GCGCTGTCAG 

3001 CAACGCTCAG AGGGCTGAGG AGGAGAGGAC AGAAAGCAAA GGCACCTGAT 

3051 ACTTCCCCTG CCACCCTGCA CACCTCCAAG TCAGGGCACC ACAACACGCC 

3101 ACCGGGAGAG ATGCTGAGAA AAACCCAAGA CCGCTCGGGA AATGCAGGAA 

3151 GGCCGGGTTG TGAGGGGTTG TTGCAATGAA ATAAATACAT TCCATGGGGC 

3201 TCACACGTTG CTGGGGAGGC CTGGAGTCAG GTTTGGGGTT TTCAGATATC 

3251 AGAAATCCCC TTGGGGGAGC AGGATGAGAC CTTTGGATAG AACAGAAGCT 

3301 GAGCAAGAGA ACATGTTGGT TTGGATAACC GGTTGCACTA TATCTGTGAG 

3351 CTCTCAAATG TCTTCTTCCC AAGGCAAGAG GTGGAAGGGT ACTGACTGGG 

3401 TTTGTTTAAA GTCAGGCAGG GCTGGAGTGA GCAGCCAGGG CCATGTTGCA 

3451 CAAGGCCTGA GAGACGGGAA AGGGCCCGAT CGCTCTTT 

SEQ ID NO: 2 (Amino Acid sequence of llcby protein) 

1 MDLEASLLPT GPNASNTSDG PDNLTSAGSP PRTGSISYIN IIMPSVFGTI 

51 CLLGIIGNST VIFAWKKSK LHWCNNVPDI FIINLSWDL LFLLGMPFMI 

101 HQLMGNGVWH FGETMCTLIT AMDANSQFTS TYILTAMAID RYLATVHPIS 

151 STKFRKPSVA TLVICLLWAL SFISITPVWL YARLIPFPGG AVGCGIRLPN 

201 PDTDLYWFTL YQFFLAFALP FWITAAYVR ILQRMTSSVA PASQRSIRLR 

251 TKRVTRTAIA ICLVFFVCWA PYYVLQLTQL SISRPTLTFV YLYNAAISLG 

301 YANSCLNPFV YIVLCETFRK RLVLSVKPAA QGQLRAVSNA QTADEERTES 

351 KGT 

Fig. IB 
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SEQ ID N0:3 (5 '-Upstream Sequence of the coding region) 

1 GAGGTCCTGG CTATTGCCAG CATGGAGTGA CCTGTGTCAC CTCTGAGTGC 

51 CAGGCAAGGG TTCAGCAGCT GACGACTCAG CTTCTGCAGG ATGCTGGCAG 

101 CATAGCCAGC GAGATAGTTG GAAGCCGTCA GGGCACAGGG AAGGGGCCGA 

151 GGGTGCCGTG AGTGTGCATG GGGGGCAGCC CTGCTGCAGT CCAAGCCTTT 

201 GATTCCCAAG CTATGTGCAC AGTTTCCTCT GGACTCTGCC ATGTGGCCCA 

251 GCCACCCATA CCTGGAATAG GGGCTAAGCC AAGCTGCTCT CTCCTCCAAA 

301 GGGAGGCAGC CTGTGTGCTT TGTCCGTTTG CCTTTGCAGA GACCTCGATC 

351 CTCACGCAAG GCAAGCAGCA GCCCCTGTAA GCACACGAGA CAATCCCAAG 

401 TGTCAGTGGG AAGGAGATCC CTTTCCTGAT GGGGCTGCCT GTGTCCAGTC 

451 CCTCCCAGCT TCCCCAGGGC CCTGGGGCTC TGCAGGCATT CAGAAGTGGA 

501 AGCCAGCCAC AGCCTGGGAC TGAAGAGGTT AATGTGCATC TGCCTCCGAA 

551 TGTTAATGTG TCTAGGTGAT GTCAGTGGGA GCCATGAAGA AGGGAGTGGG 

601 GAGGGCAGTT GGGCTTGGAG GCGGCAGCGG CTGCCAGGCT ACGGAGGAAG 

651 ACCCCCTTCC CGACTGCGGG GCTTGCGCTC CGGGACAAGG TGGCAGGCGC 

701 TGGA66CTGC CGCAGCCTGC GTGGGTGGAG GGGAGCTCAG CTCGGTTGTG 

751 6GAGCAGGCG ACCGGCACTG GCTGG 

SEQ ID NO: 4 (First coding sequence) 

1 ATGGACCTGG AAGCCTCGCT GCTGCCCACT GGTCCCAATG CCAGCAACAC 

51 CTCTGATGGC CCCGATAACC TCACTTCGGC AG 

SEQ ID N0:5 (Intron within the coding region) 

1 GTGAGTTGAC rGGGAGCCCT CCCTCCTCTG GGCTGTGGGT GGAAAATGGG 

51 AAGGTTTCAC CCCTGAGCCA AACTGCTTGG GAAACTTTAT CACAGTTCTT 

101 GG6GACAAGA TCTGTGGTCT GCTTTGCTCT GAGGGGCAGG AGAAAAGGGG 

151 GCAATGGTCC GCAGGGGCAG AGGGGCAGGA GCAGAGCAGG GGGCGAAG6C 

201 ATATTCAGAA TGGCAAGGAA GGGGGGCCAG CCGTGAGACA GCAGGGGAAG 

251 GCTCGCTGCT GGGTTCCAAA GATGCTTGGC AGAAAAAATT CCAG6CTGGA 

301 AAAGCAAGCG AGAGAAGCTG GAGGGTGGTA T6TGGGAGAC AGCTG6GGGC 

351 TCACTCCTGC ACTGTTAGCC TCAGCTTTTT ACTCCCACTT GGATGATGAG 

4.01 GTCTGAGACA TCCTTACTGC CACCTGGGAG AGGCCCTGGG AAGGGAAGAC 

451 TTCACAGAGC CATGAGGGGA TTAACTTTTC TGGTGAATTA AGCTTCCTGA 
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501 CATTTCCAGA GCTGCGGTGC CCTGGGATTC CAGCTTTGAA GGAGAAAGGA 

551 AGGAAG6AAA A6AGGAAAGG CTTATGTAGA TAATTTTTCC AG6CTGCTGA 

601 GCTCCAACAG ACAGTTrCTG TCTCTGCTTC ACTCAAGAAG CCCAGGCTCA 

651 GAAGATACCA ATCAAGGAAA TCCCCGCTAG GAAGCCTGGG GTAGGGAGAG 

701 CTGCTGGCTT GACCAGGGCA CAGCCGGCAA AA6CCTCTAC AAGACA6TCA 

751 CCCACAGATA TGCCCAAGAA TCAGTACACA GTTTCCAACC AGAGATCTCC 

801 AAAAT6AAAC ACTCAG6GCT ACACATAGGA AAAGCACGCA CACACACACA 

851 CACACACACA CAGACACTTA CTTTTGTGTC CTTCTGGCTA TGCTGACGAG 

901 TTTTCCT6GT 6AA6CCCGGG GCTCACAGAG TAATCTCTGC AGACAACTGT 

951 GGTTCTTGCC TCTGGTGCCT GCAGGAGGCA GGCATGTTGT GTCCTTCCAA 

1001 GACAGAT6GC TCA6GGCACT CTGGTAGGAT TCACCAGGAA ACTCAT66AG 

1051 AAGGGAAAAG GGACAAGATT AGCAACAGTG AAGGGAGGGA GAATGGTGGG 

1101 AGAGGATTCC AGATGAACGG TGGGTCGCTG GAGGCTGA6C ATGCCAGCA6 

1151 GATGTCAGTT CTCAGAGCAA AGCCCATGTC AAACAGCCAA CGCTTGCTCC 

1201 TTCTGTCCCC AG 

SEQ ID NO: 6 (Second coding sequence) 

1 GATCACCTCC TCGCACGGGG AGCATCTCCT ACATCAACAT CATCATGCCT 

51 TCGGTGTTCG GCACCATCTG CCTCCTGGGC ATGATCGGGA ACTCCACGGT 

101 CATCTTCGCG GTC6TGAA6A AGTCCAAGCT GCACT6GTGC AACAACGTCC 

151 CCGACATCTT CATCATCAAC CTCTCGGTAG TAGATCTCCT CTTTCTCCTG 

201 GGCATGCCCT TCATGATCCA CCAGCTCATG GGCAATGGGG TGTGGCACTT 

251 TGGGGAGACC ATGTGCACCC TCATCACGGC CATGGATGCC AATAGTCAGT 

301 TCACCAGCAC CTACATCCTG ACCGCCATGG CCATTGACCG CTACCT6GCC 

Fig. ID 
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351 ACTGTCCACC CCATCTCTTC CACGAAGTTC CGGAAGCCCT CTGTGGCCAC 

401 CCTGGTGATC TGCCTCCTGT GGGCCCTCTC CTTCATCAGC ATCACCCCTG 

451 TGTGGCTGTA TGCCAGACTC ATCCCCTTCC CAGGAGGTGC AGTGGGCTGC 

501 GGCAXACGCC T6CCCAACCC AGACACTGAC CTCTACTGGT TCACCCTGTA 

551 CCAGTTTTTC CTGGCCTTTG CCCTGCCTTT TGTGGTCATC ACAGCCGCAT 

601 ACGTGAGGAT CCTGCAGCGC ATGACGTCCT CAGTGGCCCC CGCCTCCCAG 

651 CGCAGCATCC G6CTGCGGAC AAAGA6GGT6 ACCCGCACA6 CCATCGCCAT 

701 CTGTCTGGTC TTCTTTGTGT GCTGGGCACC CTACTATGTG CTACAGCTGA 

751 CCCAGTTGTC CATCAGCCGC CCGACCCTCA CCTTTGTCTA CTTATACAAT 

801 GCGGCCATCA GCTTGGGCTA TGGCAACAGC TGCCTCAACC CCTTTGTGTA 

851 CATCGTGCTC TGTGAGACGT TCCGCAAACG CTTGGTCCTG TCGGTGMGC 

901 CTGCAGCCCA GGGGCAGCTT CGCGCTGTCA GCAACGCTCA GACGGCTGAC 

951 GA6GAGAGGA GAGAAA6CAA A6GCACCT6A 

S£Q ID NO: 7 (3 '-sequence afcer the coding sequence) 

1 TACTTCCCCT GCCACCCTGC ACACCTCCAA GTCAGGGCAC CACAACACGC 

51 CACCGGGAGA GATGCTGAGA AAAACCCAAG ACC6CTCGGG AAATGCAG6A 

101 AG6CCG6GTT GTGAGG6GTT GTTGCAATGA AATAAATACA TTCCATG66G 

151 CTCACACGTT GCTGGGGAGG CCTGGAGTCA GGTTTGGGGT TTTCAGATAT 

201 CAGAAATCCC CTTGG6GGAG CAGGATGAGA CCTTTGGATA GAACAGAAGC 

251 TGAGCAAGAG AACATGTTGG TTTGGATAAC CGGTT6CACT ATATCTGTGA 

301 GCTCTCAAAT GTCTTCTTCC CAAGGCAAGA GGTGGAAGGG TACTGACTGG 

~. 351 GTTTGTTTAA AGTCAGGCAG GGCTGGAGTG AGGAGCCAGG GCCATGTTGC 

401 ACAAGGCCTG AGAGAC6GGA AAGGGCCC6A TCGCTCTTT 
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SEQUENCE LISTING 
<110> SmithKline Beecham Corporation 
<120> llcby Genomic Sequence 

<130> GP-50010WO 

<140> To Be Assigned 
<141> 1999-12-22 

<150> 09/218,467 
<151> 1998-12-22 

<160> 7 

<170> FastSEQ for Windows Version 3.0 

<210> 1 

<211> 3488 

<212> DNA 

<213> HOMO SAPIENS 

<400> 1 

gaggtcctgg ctattgccag catggagtga cctgtgtcac ctctgagtgc caggcaaggg 60 

ttcagcagct gacgactcag cttctgcagg atgctggcag catagccagc gagatagttg 120 

gaagccgtca gggcacaggg aaggggccga gggtgccctg agtgtgcatg gggggcagcc 180 

ctgctgcagt ccaagccttt gattcccaag ctatgtgcac agtttcctct ggactctgcc 240 

atgtggccca gccacccata cctggaatag gggctaagcc aagctgctct ctcctccaaa 300 

gggaggcagc ctgtgtgctt tgtccgtttg cctttgcaga gacctcgatc ctcacgcaag 360 

gcaagcagca gcccctgtaa gcacacgaga caatcccaag tgtcagtggg aaggagatcc 420 

ctttcctgat ggggctgcct gtgtccagtc cctcccagct tccccagggc cctggggctc 480 

tgcaggcatt cagaagtgga agccagccac agcctgggac tgaagaggtt aatgtgcatc 540 

tgcctccgaa tgttaatgtg tctaggtgat gtcagtggga gccatgaaga agggagtggg 600 

gagggcagtt gggcttggag gcggcagcgg ctgccaggct acggaggaag acccccttcc 660 

cgactgcggg gcttgcgctc cgggacaagg tggcaggcgc tggaggctgc cgcagcctgc 720 

gtgggtggag gggagctcag ctcggttgtg ggagcaggcg accggcactg gctggatgga 780 

cctggaagcc tcgctgctgc ccactggtcc caatgccagc aacacctctg atggccccga 840 

taacctcact tcggcaggtg agttgactgg gagccctccc tcctctgggc tgtgggtgga 900 

aaatgggaag gtttcacccc tgagccaaac tgcttgggaa actttatcac agttcttggg 960 

gacaagatct gtggtctgct ttgctctgag gggcaggaga aaagggggca atggtccgca 1020 

ggggcagacg ggcaggagca gagcaggggg cgaaggcata ttcagaatgg caaggaaggg 1080 

gggccagccg tgagacagca ggggaaggct cgctgctggg ttccaaagat gcttggcaga 1140 

aaaaattcca ggctggaaaa gcaagcgaga gaagctggag ggtggtatgt gggagacagc 1200 

tgggggctca ctcctgcact gttagcctca gctttttact cccacttgga tgatgaggtc 1260 

tgagacatcc ttactgccac ctgggagagg ccctgggaag ggaagacttc acagagccat 1320 

gaggggatta acttttctgg tgaattaagc ttcctgacat ttccagagct gcggtgccct 1380 

gggattccag ctttgaagga gaaaggaagg aaggaaaaga ggaaaggctt atgtagataa 1440 

tttttccagg ctgctgagct ccaacagaca gtttctgtct ctgcttcact caagaagccc 1500 

aggctcagaa gataccaatc aaggaaatcc ccgctaggaa gcctggggta gggagagctg 1560 

. ctggcttgac cagggcacag ccggcaaaag cctctacaag acagtcaccc acagatatgc 1620 

ccaagaatca gtacacagtt tccaaccaga gatctccaaa atgaaacact cagggctaca 1680 

cataggaaaa gcacgcacac acacacacac acacacacag acacttactt ttgtgtcctt 1740 

ctggctatgc tgacgagttt tcctggtgaa gcccggggct cacagagtaa tctctgcaga 1800 

caactgtggt tcttgcctct ggtgcctgca ggaggcaggc atgttgtgtc cttccaagac 1860 

agatggctca gggcactctg gtaggattca ccaggaaact catggagaag ggaaaaggga 1920 
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caagattagc 
gtcgctggag 
cagccaacgc 
catcaacatc 
ctccacggtc 
cgacatcttc 
catgatccac 
catcacggcc 
cattgaccgc 
tgtggccacc 
gtggctgtat 
gcccaaccca 
cctgcctttt 
agtggccccc 
catcgccatc 
ccagttgtcc 
cttgggctat 
ccgcaaacgc 
caacgctcag 
ccaccctgca 
aaacccaaga 
ataaatacat 
ttcagatatc 
gagcaagaga 
tcttcttccc 
gctggagtga 
cgctcttt 



aacagtgaag 
gctgagcatg 
ttgctccttc 
atcatgcctt 
atcttcgcgg 
atcatcaacc 
cagctcatgg 
atggatgcca 
tacctggcca 
ctggtgatct 
gccagactca 
gacactgacc 
gtggtcatca 
gcctcccagc 
tgtctggtct 
atcagccgcc 
gccaacagct 
ttggtcctgt 
acggctgacg 
cacctccaag 
ccgctcggga 
tccatggggc 
agaaatcccc 
acatgttggt 
aaggcaagag 
gcagccaggg 



ggagggagaa 
ccagcaggat 
tgtccccagg 
cggtgttcgg 
tcgtgaagaa 
tctcggtagt 
gcaatggggt 
atagtcagtt 
ctgtccaccc 
gcctcctgtg 
tccccttccc 
tctactggtt 
cagccgcata 
gcagcatccg 
tctttgtgtg 
cgaccctcac 
gcctcaaccc 
cggtgaagcc 
aggagaggac 
tcagggcacc 
aatgcaggaa 
tcacacgttg 
ttgggggagc 
ttggataacc 
gtggaagggt 
ccatgttgca 



tggtgggaga 
gtcagttctc 
atcacctcct 
caccatctgc 
gtccaagctg 
agatctcctc 
gtggcacttt 
caccagcacc 
catctcttcc 
ggccctctcc 
aggaggtgca 
caccctgtac 
cgtgaggatc 
gctgcggaca 
ctgggcaccc 
ctttgtctac 
ctttgtgtac 
tgcagcccag 
agaaagcaaa 
acaacacgcc 
ggccgggttg 
ctggggaggc 
aggatgagac 
ggttgcacta 
actgactggg 
caaggcctga 



ggattccaga 
agagcaaagc 
cgcacgggga 
ctcctgggca 
cactggtgca 
tttctcctgg 
ggggagacca 
tacatcctga 
acgaagttcc 
ttcatcagca 
gtgggctgcg 
cagtttttcc 
ctgcagcgca 
aagagggtga 
tactatgtgc 
ttatacaatg 
atcgtgctct 
gggcagcttc 
ggcacctgat 
accgggagag 
tgaggggttg 
ctggagtcag 
ctttggatag 
tatctgtgag 
tttgtttaaa 
gagacgggaa 



tgaacggtgg 
ccatgtcaaa 
gcatctccta 
tcatcgggaa 
acaacgtccc 
gcatgccctt 
tgtgcaccct 
ccgccatggc 
ggaagccctc 
tcacccctgt 
gcatacgcct 
tggcctttgc 
tgacgtcctc 
cccgcacagc 
tacagctgac 
cggccatcag 
gtgagacgtt 
gcgctgtcag 
acttcccctg 
atgctgagaa 
ttgcaatgaa 
gtttggggtt 
aacagaagct 
ctctcaaatg 
gtcaggcagg 
agggcccgat 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3488 



<210> 2 

<211> 353 

<212> PRT 

<213> HOMO SAPIENS 



<400> 2 



Met 


Asp 


Leu 


Glu 


Ala 


Ser 


Leu 


Leu 


Pro 


Thr 


Gly Pro 


Asn 


Ala 


Ser 


Asn 


1 








5 










10 








15 




Thr 


Ser 


Asp 


Gly 
20 


Pro 


Asp 


Asn 


Leu 


Thr 
25 


Ser 


Ala Gly 


Ser 


Pro 
30 


Pro 


Arg 


Thr 


Gly 


Ser 
35 


He 


Ser 


Tyr 


He 


Asn 
40 


He 


He 


Met Pro 


Ser 
45 


Val 


Phe 


Gly 


Thr 


lie 
50 


Cys 


Leu 


Leu 


Gly 


He 
55 


He 


Gly 


Asn 


Ser Thr 
60 


Val 


He 


Phe 


Ala 


Val 


Val 


Lys 


Lys 


Ser 


Lys 


Leu 


His 


Trp 


Cys 


Asn Asn 


Val 


Pro 


Asp 


He 


65 










70 










75 








80 


Phe 


He 


He 


Asn 


Leu 
85 


Ser 


Val 


Val 


Asp 


Leu 
90 


Leu Phe 


Leu 


Leu 


Gly 
95 


Met 


Pro 


Phe 


Met 


He 
100 


His 


Gin 


Leu 


Met 


Gly 
105 


Asn 


Gly Val 


Trp 


His 
110 


Phe 


Gly 


Glu 


Thr 


Met 


Cys 


Thr 


Leu 


He 


Thr 


Ala 


Met 


Asp Ala 


Asn 


Ser 


Gin 


Phe 






115 










120 






125 








Thr 


Ser 
130 


Thr 


Tyr 


He 


Leu 


Thr 
135 


Ala 


Met 


Ala 


He Asp 
140 


Arg 


Tyr 


Leu 


Ala 


Thr: 


Val 


His 


Pro 


He 


Ser 


Ser 


Thr 


Lys 


Fhe 


Arg Lys 


Pro 


Ser 


Val 


« "1 _ 


145 










150 










155 








160 


Thr 


Leu 


Val 


He Cys 


Leu 


Leu 


Trp 


Ala 


Leu 


Ser Phe 


He 


Ser 


He 


Thr 










165 










170 








175 




Pro 


Val 


Trp 


Leu Tyr 


Ala 


Arg 


Leu 


He 


Pro 


Phe Pro 


Gly 


Gly 


Ala 


Val 








180 










185 








190 






Gly 


Cys 


Gly 


He Arg 


Leu 


Pro 


Asn 


Pro 


Asp 


Thr Asp 


Leu 


Tyr 


Trp 


Phe 
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195 






9 n n 






205 








Thr 


Leu 


Tyr 


Gin Phe 


Phe 


Leu Ala 


IT lie 




Tif*ij Pro Phe 


Val 


Val 


He 




210 












220 




Val 


Ala 


Thr 


Ala 


Ala Tyr Val Arg 


T T ^ T All 




Arg 


Vi^-t- Thr Ser 


Ser 


225 








230 














240 


Pro 


Ala 


Ser 


Gin Arg 


Ser 


xxe Arg 


Leu 


Arg 


'Vlnir T.\7Q ArtT 
J.X1JL ijyo 


Val 


Thr 


Arg 








245 














255 




Thr 


Ala 


He 


Ala He 


Cys 


Leu Val 


c He 


irlic 




Ala 


Pro 










260 




265 






270 






Tyr 


va± 


Leu 


Gin Leu 


Thr 


Gin Leu 


Ser 


He 


Ser Arg Pro 


Thr 


Leu 


Thr 




275 






280 






285 








Phe 


Val 


Tyr 


Leu Tyr 


Asn 


Ala Ala 


He 


Ser 


Leu Gly Tyr 


Ala 


Asn 


Ser 




290 




295 






300 








Cys 


Leu 


Asn 


Pro Phe 


Val 


Tyr He 


Val 


Leu 


Cys Glu Thr 


Phe 


Arg 


Lys 


305 








310 








315 






320 


Arg 


Leu 


Val 


Leu Ser 


Val 


Lys Pro 


Ala 


Ala 


Gin Gly Gin 


Leu 


Arg 


Ala 






325 








330 






335 




Val 


Ser 


Asn 


Ala Gin Thr Ala Asp 


Glu 


Glu 


Arg Thr Glu 


Ser 


Lys 


Gly 








340 






345 






350 






Thr 

























60 



<210> 3 
<211> 775 
<212> DNA 
<213> HOMO SAPIENS 

<400> 3 

gaggtcctgg ctattgccag catggagtga cctgtgtcac ctctgagtgc caggcaaggg 

ttcagcagct gacgactcag cttctgcagg atgctggcag catagccagc gagatagttg 120 

gaagccgtca gggcacaggg aaggggccga gggtgccctg agtgtgcatg gggggcagcc 180 

ctgctgcagt ccaagccttt gattcccaag ctatgtgcac agtttcctct ggactctgcc 240 

atgtggccca gccacccata cctggaatag gggctaagcc aagctgctct ctcctccaaa 300 

gggaggcagc ctgtgtgctt tgtccgtttg cctttgcaga gacctcgatc ctcacgcaag 360 

gcaagcagca gcccctgtaa gcacacgaga caatcccaag tgtcagtggg aaggagatcc 420 

ctttcctgat ggggctgcct gtgtccagtc cctcccagct tccccagggc cctggggctc 4 80 

tgcaggcatt cagaagtgga agccagccac agcctgggac tgaagaggtt aatgtgcatc 540 

tgcctccgaa tgttaatgtg tctaggtgat gtcagtggga gccatgaaga agggagtggg 600 

gagggcagtt gggcttggag gcggcagcgg ctgccaggct acggaggaag acccccttcc 660 

cgactgcggg gcttgcgctc cgggacaagg tggcaggcgc tggaggctgc cgcagcctgc 720 

gtgggtggag gggagctcag ctcggttgtg ggagcaggcg accggcactg gctgg 775 

<210> 4 

<211> 82 

<212> DNA 

<213> HOMO SAPIENS 

<400> 4 

atggacctgg aagcctcgct gctgcccact ggtcccaatg ccagcaacac ctctgatggc 
cccgataacc tcacttcggc ag 

<210> 5 
<211> 1212 
' <212> DNA 

<213> HOMO SAPIENS 



60 
82 



<400> 5 

gtgagttgac tgggagccct ccctcctctg ggctgtgggt ggaaaatggg aaggtttcac 
ccctgagcca aactgcttgg gaaactttat cacagttctt ggggacaaga tctgtggtct 



60 
120 
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gctttgctct gaggggcagg agaaaagggg gcaatggtcc gcaggggcag acgggcagga 180 

gcagagcagg gggcgaaggc atattcagaa tggcaaggaa ggggggccag ccgtgagaca 24 0 

gcaggggaag gctcgctgct gggttccaaa gatgcttggc agaaaaaatt ccaggctgga 300 

aaagcaagcg agagaagctg gagggtggta tgtgggagac agctgggggc tcactcctgc 360 

actgttagcc tcagcttttt actcccactt ggatgatgag gtctgagaca tccttactgc 420 

cacctgggag aggccctggg aagggaagac ttcacagagc catgagggga ttaacttttc 480 

tggtgaatta agcttcctga catttccaga gctgcggtgc cctgggattc cagctttgaa 540 

ggagaaagga aggaaggaaa agaggaaagg cttatgtaga taatttttcc aggctgctga 600 

gctccaacag acagtttctg tctctgcttc actcaagaag cccaggctca gaagatacca 660 

atcaaggaaa tccccgctag gaagcctggg gtagggagag ctgctggctt gaccagggca 720 

cagccggcaa aagcctctac aagacagtca cccacagata tgcccaagaa tcagtacaca 780 

gtttccaacc agagatctcc aaaatgaaac actcagggct acacatagga aaagcacgca 840 

cacacacaca cacacacaca cagacactta cttttgtgtc cttctggcta tgctgacgag 900 

ttttcctggt gaagcccggg gctcacagag taatctctgc agacaactgt ggttcttgcc 960 

tctggtgcct gcaggaggca ggcatgttgt gtccttccaa gacagatggc tcagggcact 1020 

ctggtaggat tcaccaggaa actcatggag aagggaaaag ggacaagatt agcaacagtg 1080 

aagggaggga gaatggtggg agaggattcc agatgaacgg tgggtcgctg gaggctgagc 1140 

atgccagcag gatgtcagtt ctcagagcaa agcccatgtc aaacagccaa cgcttgctcc 1200 

ttctgtcccc ag 1212 

<210> 6 

<211> 980 

<212> DNA 

<213> HOMO SAPIENS 

<400> 6 

gatcacctcc tcgcacgggg agcatctcct acatcaacat catcatgcct tcggtgttcg 60 

gcaccatctg cctcctgggc atcatcggga actccacggt catcttcgcg gtcgtgaaga 120 

agtccaagct gcactggtgc aacaacgtcc ccgacatctt catcatcaac ctctcggtag 180 

tagatctcct ctttctcctg ggcatgccct tcatgatcca ccagctcatg ggcaatgggg 240 

tgtggcactt tggggagacc atgtgcaccc tcatcacggc catggatgcc aatagtcagt 300 

tcaccagcac ctacatcctg accgccatgg ccattgaccg ctacctggcc actgtccacc 360 

ccatctcttc cacgaagttc cggaagccct ctgtggccac cctggtgatc tgcctcctgt 420 

gggccctctc cttcatcagc atcacccctg tgtggctgta tgccagactc atccccttcc 480 

caggaggtgc agtgggctgc ggcatacgcc tgcccaaccc agacactgac ctctactggt 540 

tcaccctgta ccagtttttc ctggcctttg ccctgccttt tgtggtcatc acagccgcat 600 

acgtgaggat cctgcagcgc atgacgtcct cagtggcccc cgcctcccag cgcagcatcc 660 

ggctgcggac aaagagggtg acccgcacag ccatcgccat ctgtctggtc ttctttgtgt 720 

gctgggcacc ctactatgtg ctacagctga cccagttgtc catcagccgc ccgaccctca 780 

cctttgtcta cttatacaat gcggccatca gcttgggcta tgccaacagc tgcctcaacc 840 

cctttgtgta catcgtgctc tgtgagacgt tccgcaaacg cttggtcctg tcggtgaagc 900 

ctgcagccca ggggcagctt cgcgctgtca gcaacgctca gacggctgac gaggagagga 960 

cagaaagcaa aggcacctga 980 

<210> 7 

<211> 439 

<212> DNA 

<213> HOMO SAPIENS 

<400> 7 

tacttcccct gccaccctgc acacctccaa gtcagggcac cacaacacgc caccgggaga 60 

gatgctgaga aaaacccaag accgctcggg aaatgcagga aggccgggtt gtgaggggtt 120 

gttgcaatga aataaataca ttccatgggg ctcacacgtt gctggggagg cctggagtca 160 

ggtttggggt tttcagatat cagaaatccc cttgggggag caggatgaga cctttggata 240 

gaacagaagc tgagcaagag aacatgttgg tttggataac cggttgcact atatctgtga 300 

gctctcaaat gtcttcttcc caaggcaaga ggtggaaggg tactgactgg gtttgtttaa 360 

agtcaggcag ggctggagtg agcagccagg gccatgttgc acaaggcctg agagacggga 420 

aagggcccga tcgctcttt 439 
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